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BACKGROUND OF THE INVENTION 
A variety of biochemical processes, particularly those involving protein-protein 

* — . 

interactions, are believed to be mediated by an induced conformational change in the protein 
target. The resulting structural change in the protein is then used to explain a modification in 
its function (e.g., enzymatic activity) or its affinity for another protein in the biological 
system. Conformational change has been proposed to occur in the cascade of steps associated 
with certain signal transduction pathways in eukaryotic cells. A ubiquitous component of 
such signal transduction pathways is a heterotrimeric guanine nucleotide-binding protein (G- 
protein) coupied to a cell surface receptor (for reviews see references 1-4 and 72). G-proteins 
relay signals initiated by various stimuli including photons, odorants, and a number of 
hormones and neurotransmitters. A variety of diseases are caused by defects in G-protein 
activity. G-proteins exist as heterotrimeric complexes of a, p, and y subunits. The a-subunit 
(Ga) is weakly bound to a dimer (Gpy) in which the (3-subunit is tightly bound to the y- 
subunit. Ga is also associated with the intracellular carboxy terminal tail of a seven-helical 
transmembrane receptor. G-proteins transfer signals from more than 1000 receptors with 
various Ga subtypes regulating a variety of distinct downstream signaling pathways. 
Guanine nucleotide binding and GTPase function within the Ga domain to regulate the 

activity of G-proteins. 

The G-protein signaling process is typically initiated by the binding of an agonist to 
the cell surface receptor resulting in an induced conformational change in the G-protein. The 
G-protein structural change affects the guanine nucleotide affinity of Ga, so that it 
preferentially binds GTP and Mg 2+ over GDP. Numerous x-ray structures for G ial during the 
various stages of the GTPase cycle have been used to identify regions of induced 
conformational change (5-8). In particular, the Ga guanine nucleotide binding site is 
composed of three distinct "switch" regions: residues V179-V185 in switch I, residues Q204- 
H213 in switch II and residues A235-N237 in switch HI, which undergo conformational 
changes upon GTP hydrolysis. The Ga surface that binds the G(3y dimer contains switch I 
and switch II regions. In the active Ga-GTP-Mg 2+ complex, a conformational change in 
switch I is associated with binding Mg 2 *, and switch H and switch HI regions become well 
ordered due to ionic interactions between the two switch regions. As a result of the formation 
of the Ga-GTP-Mg 2 complex, modifications in the structure of the three "switch" regions 
facilitate dissociation of Ga from Gf3y. The released subunits are then available to interact 



with a variety of target proteins to elicit the desired response. Termination of the signal 
results when. the process is reversed by the hydrolysis of GTP bound to Get. Reassociation of 
Go^with GPy results in the inactivation of the G-protein. Therefore the duration of the G- 
protein signal is directly dependent on the GTPase activity of the Get protein. 

Regulators of G-protein signaling (RGS) affect the intensity and duration of the G- 
protein signal cascade by binding to the active Ga-GTP-Mg 2 complex and inducing a 50-fold 
increase in the rate of GTP hydrolysis (For reviews see references 9-13). Conversely, RGS 
proteins have little or no affinity for the inactive Gcc-GDP complex. Thus, RGS proteins act 
as attenuators of the induced G-protein signal by increasing the rate of inactivation of the G- 
protein and termination of the signal. RGS proteins may exhibit additional biological 
function, e.g., RGS4 is reported to block activation of GTP-Ga by effectors (83). The RGS 
family, including RGS4, GAIP (human Ga-interacting protein), RGS1, RGS 10, and RGS 16, 
among others, contains more than 20 known members where specificity for Ga subtypes has 
been demonstrated and appears to be associated with subtle sequence differences (8, 14). The 
.RGS family contains significant structural diversity, however, all RGS proteins are 
characterized by a conserved domain of about 130 amino acids which may be separated by 
linker regions of varying lengths. Recently, the RGS family has been reported to comprise at 
least six separate subfamilies designated A-F with unique structural features (Zheng, B. et al. 
(1999) (86), RGS4 exhibits structural features of RGS subfamily A. Subfamily-specific 
structural features may be associated with subfamily-specific functions, e.g., differences in 
Ga binding specificity among RGS proteins, membrane association of RGS protein, or 
functions exhibited by RGS proteins in addition to GAP activity. RGS4 is believed to 
function to attenuate induced G-protein by stabilizing the transition. 

RGS proteins are widely expressed in eukaryotic cells, including human cells (13). 
At least one RGS protein is found in tissue of each human organ and many tissues express 
multiple RGS proteins. Additionally, members of the RGS family are specifically expressed 
in the human brain, where RGS4 is perhaps the most widely distributed and highly expressed 
RGS subtype (15,1 6). RGS expression has been correlated with a response to induced 
seizures, which indicates that regulation of RGS expression is an adaptive response in the 
brain signal transduction pathway to compensate for desensitization and sensitization of G- 
protein-coupled receptor function (16). In addition to regulation of the response to 
neurotransmiters, RGS activity has been associated with a variety of cellular functions 



including: cell proliferation, cell differentiation, membrane trafficking and embryonic 
'development (9, 10, 12, 17). 

An x-ray structure of RGS4 bound to Gial [8], site-directed mutagenesis [18-20] and 
biochemical studies [17, 21] suggest that RGS4 binds preferentially to the Ga-GTP-Mg 2+ 
5 complex and stabilizes the transition state structure of the switch regions facilitating 

hydrolysis of GTP. Since the functional result of RGS4 binding to Gial is induction of GTP 
hydrolysis by Gial, it is reasonable to anticipate that the conformational change upon 
complex formation with RGS4 primarily occurs in G ial . However, the x-ray crystal structure 
of Gial in the RGS4-Gia 1 complex exhibits only a 0.6 A rms difference from that of Gial 
10 in Gial-AIF/ which is trapped in the proposed transition state for GTP hydrolysis. This 
comparison indicates that there is no significant conformational change in GiaL On the 
other hand, analysis of the RGS4-Gial complex x-ray structure indicates that RGS4 binding 
to Gial induces a decrease in the mobility of the switch regions of GiaL In these regions, 
yg critical interactions occur between N82 of RGS4 (employing the numbering of Figure 1) with 

]|5 the switch regions I and II of Gial and between T182 of Gial with a Gee binding pocket on 
j;; RGS4. The RGS4 residue N82 has been identified as critical for facilitating the intrinsic Gial 

111 GTPase activity presumably by stabilizing the switch regions and substrate binding (19, 20). 

'i T\ 

Z Similar changes in the switch regions are observed between the Ga-GTP-Mg 2 complex and 

the Ga-GDP complex (2), suggesting that a conformational change in RGS4 may contribute 
fpO to regulation of G-protein signaling. 

K de Alba, E. et al. (1 999) (87) reports the solution structure of human GAIP as 

H ; determined by NMR techniques. The structure calculation used dipolar couplings of the 

oriented protein in two different liquid crystal media. The GAIP solution structure was 
compared to that of the rat RGS4-G l a I x-ray structure (8). The reference suggests that GAIP- 
25 LI 87 participates in Ga-RGS binding and may also be important in the folding and stability 
of the RGS protein. It is also suggested that GAIP-S156 plays a role in GAIP stability. 

r 

GAIP-S156 has been identified as a subfamily-specific residue for the RGS subfamily A 
[GAIP, Ret-RGSl, RGS21] (86) and Wang et al. (89). In RGS subfamily B which includes 
RGS4, the core amino acid corresponding to GAIP SI 56 is RGS N82 (as numbered in Fig. 1 
30 and N128 as numbered in Tesmer et al. (8)). The core region of GAIP is reported to have 
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only 60% sequence identity to the core of RGS4. Any differences observed between these 
"twQ structures are at least in part due to the differences in amino acid sequence. 

It is thus desirable to provide structural information for free RGS4 to better 
understand the mechanism of the regulation of G-protein signaling. More specifically, such 
5 structural information allows a direct comparison between the solution structure of RGS4 and 
the x-ray structure of the RGS4-Gcc complex to determine which conformational changes 
occur in RGS proteins on binding to Get. The structural information and comparison can be 
employed to identify factors (chemical or biochemical species) that affect G- protein 
signaling by interaction with RGS proteins or their complexes with Ga. The structurally 
1 0 information can be of particular use in the identification and rational design of agonists and 
antagonists of free RGS and RGS/Ga complex activity. 

* SUMMARY OF THE INVENTION 
*□ The present invention provides the three-dimensional solution structure of a free (i.e., 

Jl 5 not complexed) RGS protein of subfamily B, specifically that of free RGS4, as determined by 
§ \ NMR (nuclear magnetic resonance) spectroscopy. Particularly, the invention provides the 

O three-dimensional solution structure of a Ga binding site of an RGS subfamily B protein, 

ffi The Ga binding site of RGS subfamily B is exemplified by the three-dimensional structure of 

- the RGS4 Giexl binding site comprising the RGS4 protein residues DI 17, SI 18 and R121. 

S20 The invention also provides the three-dimensional structure of the cc6 - <x7 region of a free 
RGS subclass B which region exhibits a significant conformational change on binding of 
K RGS to Ga. Binding at the cc6 - al region of RGS protein can effect the function of RGS 

^ protein in G-protein signaling. Further, the invention identifies and provided the three- 

dimensional structure of an allosteric binding site in an RGS protein. Binding at this 
25 allosteric site can affect the regulation of G-protein signaling. An allosteric binding site m 
the RGS protein is exemplified by the allosteric binding site in RGS4 located in the a 1 and 
cc2 helical regions of free RGS4 and in the tight turn located between the two helical regions. 
More specifically, the allosteric binding site in RGS4 comprises the residues V10, W13, LI 7, 
120, H23, E24, C25 and T132. 
30 The three dimensional structure of free RGS4 in solution, including the Ga binding 

site, the C-terminus a 6 - a 7 region of free RGS4, and the allosteric binding site in free RGS4 
are provided by the relative atomic structural coordinates given in Table 2 as obtained by 
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NMR spectroscopy. Also provided are the 15 N, i3 C, I3 CO and ! H NMR assignments for free 
RGS4 (Table 1) which are employed in the determination of its secondary and three- 
dimensional structure. These assignments are also useful in methods for identifying or 
detecting chemical and biochemical species that bind to RGS and which can affect RGS 
5 function or RGS-Ga function and are particularly useful for identifying or detecting species 
that bind to RGS subclass B which can affect its function or RGS subfamily B-Ga function. 

The invention further provides a representation or model of all or part of the three- 
dimensional structure of a free RGS subfamily B protein comprising a data set of relative 
atomic structural coordinates embodying the three-dimensional structure of free RGS4 
10 protein. The invention also provides a data set of relative atomic structural coordinates 

embodying the three-dimensional structure of the Ga binding site in an RGS subfamily B 
protein. The invention further provides a data set of relative atomic structural coordinates 
embodying the a 6 - a 7 region of an RGS subfamily protein. In addition, the invention 

0 provides a data set of relative atomic structural coordinates embodying an allosteric binding 
f5 site in an RGS subfamily B protein. The data set and any structural representation or model 

s:5: 

If! of a free RGS subfamily B, its Ga binding site its a 6 - a 7 region or the allosteric binding site 
Q in RGS subfamily B created or generated using the data set provided herein can be employed 
If; to identify, select or rationally design factors, e.g., chemical or biochemical species, which 

1 affect RGS function or activity or RGS/Ga complex activity or function. Further, the data 
ID set, structural representation or model of the Ga binding site can also be used to identify, 
'X select or rationally design species which affect Ga function by binding to Ga. The data set 
O and structural representations and models provided by this invention are particularly useful 

for the identification of agonists or antagonists of RGS function or RGS/Ga complex 
function or activity. 

25 The data set, including subsets of data embodying the Ga binding site, the a6 - a7 

region and the allosteric binding site, provided herein was determined by NMR analysis. 
However, any known method can be employed to provide the structural data. In one 
embodiment, the data set embodies the structure of free RGS4 in solution. In certain ^ 
embodiments, the data set comprises one or more portions, of the structure of free RGS4. Of 

30 particular interest are those portions of the structure of RSG4 which function in RSG- 

regulation of G-protein signaling or more specifically which affect binding of RGS to Ga or 
which affect the biological function or activity of the RGS-Ga complex. Also of interest are 
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those portions of the structure of RSG4 to which candidate agonists and antagonists of RSG 
bin&to affect its biological function. 

Any available method may be used to construct a structural representation or model 
from the NMR-derived data disclosed herein or from data obtained from an independent 
5 structural analysis of free RGS4. Such a model or representation can be generated or 

constructed from the available analytical data points using software packages such as HKL, 
CHARMM, MOSFILM, XDS, CCP4, SHARP, PHASES, HEAVY, XPLOR, TNT, 
NMRCOMPASS, NMRPIPE, DIANA, NMRDRAW, FELIX, VNMR, MADIGRAS, 
QUANTA, BUSTER, SOLVE, O, FRODO, XPLOR, RASMOL, and CHAIN, all of which 
10 are well-known and available to those in the art. A structural representation or model can be 
generated from these data using available systems, including, for example, Silicon Graphics, 
Evans and Sutherland, SUN, Hewlett Packard, Apple Macintosh, DEC, IBM, and Compaq 
systems. The structural representation or model can be displayed or generated in any two- 
O dimensional or three-dimensional form known in the art for viewing, analyzing, modeling or 
J';5 otherwise representing the structure. The structural representation can be transmitted, 
J conveyed or stored in any known graphic, digital or analog form. Structural representations 
D or models generated with the RGS data provided herein can be combined with structural 
m representations of other chemical and biochemical species (e.g., candidate antagonists or 
1_ agonists) including x-ray data of RGS-compIexes, in order to analyze potential interactions 

IfO between RGS, particularly RGS subfamily B proteins, and Ga and those species. The data 
^ provided herein may also be combined, as illustrated herein, with structural information 
O (including x-ray data) of RGS-compIexes, particularly RGS subclass B protein-complexes 

and particularly those complexes believed to be or believed to model biologically functional 
complexes. 

25 The present invention relates to the structural data for free RGS4, the Ga binding site 

of RGS4, the a6 - a 7 region whose conformation changes on binding of RGS4 to Ga, and 
allosteric binding sites in RGS4 in any form (for example in digital, tabular, graphic, or 
pictorial form or as embodied in any representation or model or as embodied in a computer 
storage medium) and the use of the data (in whatever form) for generating a structural 

30 representation or model of free RGS, particularly an RGS subfamily B protein, more 

particularly RGS4, or of the interaction of RGS, an RGS subfamily B protein, and RGS4, 
with any other chemical or biochemical species, including structural representations or 

7 



models of RGS interaction with G-protein submits and of RGS interaction with potential 
agonists or antagonists of RGS function. 

The present invention also provides for a computer system which comprises the 
structural representation or model of the invention and hardware used for construction, 
5 processing and/or visualization of the model of the invention. The solution structural 
coordinates of RGS4 or portions thereof as provided herein can be stored in or on an 
appropriate medium for introduction into or use with any computer program or system for 
generating a representation or model of the structure of an RGS protein, an RGS subclass B 
protein or RGS4, or for analysis of the interaction of RGS with other chemical or biochemcial 
10 species. 

The structural coordinates can be stored in a machine-readable form on a machine- 
readable storage medium, for example, a computer hard drive, diskette, DAT tape, etc., for 
display as a three-dimensional shape or for other uses involving computer-assisted 
r*! manipulation of, or computation based on, the structural coordinates or the three-dimensional 
% structures they define. By way of example, the data defining the three dimensional structure 
JE of RGS4 of the present invention, or of a portion of RGS4 as disclosed herein, may be stored 

0 in a machine-readable storage medium, and may be displayed as a graphical three- 

!C! dimensional representation of the relevant structural coordinates, typically using a computer 

1 capable of reading the data from said storage medium and programmed with instructions for 
creating the representation from such data. 

IIJ Accordingly, the present invention provides a machine, such as a computer, 

J|i programmed in memory with the coordinates of the RGS4 or RGS subfamily B protein, or 
-™" * ... 

^ portions thereof (such as, by way of example, the coordinates of the RSG4 Ga binding site, 

the a6 - a7 region of RGS4, or the allosteric binding site in the al-al region of RGS4), 
25 together with a program capable of converting the coordinates into a three dimensional 

graphical representation of the structural coordinates on a display connected to the machine. 

A machine having a memory containing such data aids in the rational design or selection of 

inhibitors or activators of RGS, Ga or RGS-Ga complex activity, including the evaluation of 

the ability of a particular chemical or biochemical species to favorably associate with RGS, 
30 particularly an RGS subclass B protein, as well as in the modeling of compounds, proteins, 

complexes, etc. related by significant structural or sequence homology to RGS4 or other RGS 

proteins. 
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The present invention is additionally directed to a method of determining the three 
dimensionaLsolution structure of a compound, e.g., a protein or peptide or other chemical or 
biochemical species (including RGS proteins or portions thereof, or more specifically, RGS 
subfamily B proteins or portions thereof that are not RGS4) whose structure is unknown, 
5 comprising the steps of first obtaining a solution of the protein or peptide whose structure is 
unknown, and then generating NMR data from this solution. The NMR data from the protein 
or peptide can then be compared with the known three dimensional structure of RGS4 (or 
portion thereof, e.g., the Ga binding site) as disclosed herein, and the three dimensional 
structure of the protein or peptide whose structure is unknown conformed to the known RGS4 
10 structure using standard techniques, such as 2D, 3D and 4D isotope filtering, editing and 
triple resonance NMR techniques, computer homology modeling as well an adaptation of 
molecular replacement techniques as applied to NMR data. Alternatively, a three 
dimensional model of a protein or peptide of unknown structure, but related by sequence 
O similarity to RGS4, may be generated by initial sequence alignment between RGS4 and the 

y§5 protein or peptide, based on any or all amino acid sequence identity, secondary structure 
f ; elements or tertiary folds, and then generating by computer modeling a three dimensional 

C structure for the molecule using the known three dimensional structure of, and sequence 

If; 

jf I alignment with, RGS4. 

^ Methods are also provided for identifying a species which is an agonist or antagonist 

So of RGS activity, RGS binding to Ga, Ga binding to RGS, or RGS/Ga complex activity, 
r S particularly for RGS subfamily B proteins. The method comprises the steps of using a three 

6 dimensional structure of free RGS subfamily B protein or a portion (e.g., an RGS4 core 

protein) thereof as defined by the relative structural coordinates of amino acids encoding the 
RGS4-core protein to design or select a potential agonist or antagonist, and synthesizing or 
25 otherwise obtaining the potential agonist or antagonist. The potential agonist or antagonist 

may be selected by screening an appropriate database, may be designed de novo by analyzing 
the steric configurations and charge potentials of the RGS4 Ga binding site, the a 6 - a 7 region 
of RGS4, or an allosteric binding site of RGS4 in conjunction with the appropriate software 
programs, or may be designed using characteristics of known agonists or antagonists of 
30 RGS4, RGS subfamily B, or other RGS proteins in order to create "hybrid" agonists or 
antagonists. The method of the present invention is preferably used to design or select 
inhibitors of RGS subfamily B proteins, or RGS subclass B-Ga complex activity, and 
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specifically RGS4 or RGS4-Gial complex activity. In a specific embodiment, the potential 
agonist or antagonist is identified, selected or designed by studying the interaction of 
candidate species with a three-dimensional model of RGS4 (or a portion thereof) or a three- 
dimensional model of another RGS subfamily B protein (or model thereof) and selecting a 
species which is predicted by its interaction with the RGS protein or a portion of an RGS 
protein to act as an agonist or antagonist. Potential antagonists and agonists can be readily 
tested using various procedures disclosed herein or known in the art to confinn their 
antagonist or agonist function. Species identified in accordance with such methods are also 
provided. 

Other specific embodiments provide: (1) a process of identifying a substance that 
inhibits RGS4 activity, RGS4 binding to Giccl, Giccl binding to RGS 4 or RGS4/Gial 
complex activity comprising determining the interaction between a candidate substance and a 
model of all of part of the structure of free RGS4, or (2) a process of identifying a substance 
that mimics or promotes RGS4 activity, RGS4 binding to Giccl, Giccl binding to RGS4 or 
RGS4/G<x complex activity comprising determining the interaction between a candidate 
substance and a model of all or part of the structure of free RGS4 by analyzing the steric 
configuration and charge potential of free RGS4 and comparing these properties to those of a 
candidate substance. Substances identified in accordance with such processes are also 
provided. 

Other embodiments provide a method of identifying antagonists or agonists of RGS 
activity, RGS binding to Ga, Ga binding to RGS or RGS/Ga complex activity by rational drug 
design comprising: (a) designing a potential antagonist or agonist that will form a reversible or 
non-reversible complex with one or more amino acids in the RGS Gcc binding site based upon 
the structure coordinates of free RGS4; (b) synthesizing or otherwise obtaining the antagonist 
or agonist; and (c) determining whether the potential antagonist or agonist inhibits or promotes 
the activity or binding of RGS or the activity of the RGS-Ga complex. In other preferred 
embodiments, the antagonist or agonist is designed to interact with one or more atoms of one or 
more amino acids in the RGS4-Gial binding site. More specifically, the antagonist or agonist 
is designed to interact with amino acids selected from the group consisting of Dl 17, SI 18, or 
R121 of RGS4, other amino acids associated with the Gcc binding site and other amino acids 
revealed by the determined structure. Yet more specifically, the antagonist or agonist is 
designed to interact with amino acids selected from the group consisting of S39, E41, N42, 
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LI 13, D117, SI 18, R121 or N82 of RGS4. Substances identified in accordance with such 
processes are also provided. The agonist or antagonist may form a covalent or non-covalent 
bond with an RGS protein. This method is specifically applicable to identifying antagonists or 
agonists of RGS subfamily B proteins. 

Other specific embodiments provide a method of identifying antagonists or agonists of 
RGS activity or RGS/Ga complex activity by rational drug design comprising: (a) designing a 
potential antagonist or agonist that will form a reversible or non-reversible complex with one or 
more amino acids in a a6 - a 7 region of RGS based upon the structure co-ordinates of free 
RGS4; (b) synthesizing the antagonist or agonist; and (c) determining whether the potential 
antagonist or agonist inhibits or promotes the activity of RGS or RGS/Ga complex. In preferred 
embodiments, the antagonist or agonist is designed to prevent or facilitate conformation change 
in these regions on binding to Ga. This method is specifically applicable to identifying 
antagonists or agonists of RGS subfamily B protein activity or RGS subfamily B/Ga complex 
activity. 

Other specific embodiments provide a method of identifying antagonists or agonists of 
RGS activity or RGS/Ga complex activity by rational drug design comprising: (a) designing a 
potential antagonist or agonist that will form a reversible or non-reversible complex with one or 
more amino acids in an RGS4 allosteric binding site based upon the structure co-ordinates of free 
RGS4; (b) synthesizing the antagonist or agonist; and (c) determining whether the potential 
antagonist or agonist inhibits or promotes the activity of RGS or RGS/Ga complex. In preferred 
embodiments, the antagonist or agonist is designed to interact with the allosteric binding site in 
the a r a 2 region of RGS4. In yet other preferred embodiments^the antagonist or agonist is 
designed to interact with one or more atoms of one or more amino acids in the allosteric binding 
site in the a x and a 2 region of RGS4, and particularly with one or more atoms of amino acids 
VI 0, W13, L17, L20 ? H23, E24, C25, or T132 of RGS4. Substances identified in accordance 
with such processes are also provided. This method is specifically applicable to identifying 
antagonists or agonists of RGS subclass B protein activity or RGS subclass B/Ga complex 
activity. 

Candidate agonists and antagonists of RGS T RGS-Ga complexes can be selected from 
any type of small molecule, dimer, multimer, oligomer, or polymer of natural or non-natural 
origin that is obtained from any source and may be isolated from a natural source or chemically 
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or biologically synthesized. Candidate antagonists and agonists can include nucleic acids, 
peptides, polypeptides, proteins, and various small organic molecules. 

The study of the interaction of the candidate species with the three-dimensional 
structure of RGS and/or portions of that structure can be performed using available software 
5 platforms, including QUANTA, RASMOL, O, CHAIN, FRODO, INSIGHT, DOCK, 
MCSS/HOOK, CHARMM, LEAPFROG, CAVEAT(UC Berkley), CAVEAT(MSI), 
MODELLER, CATALYST, and ISIS. 

The invention also provides a method for identifying the presence of and determining 
the location of allosteric binding sites in RGS4. The method comprises the steps of 
10 contacting free RGS4-core in solution with test compounds that are members of a library of 
chemical species which encompass a range of structural features or which are known to 
inhibit RGS function; measuring the ! H, i5 N, and/or l3 C NMR spectra of the RGS4-core in 
the presence of test compounds of the library to detect any perturbations in the chemical 
^ shifts of RGS4-core that are induced by binding of a test compound to RGS4-core, and 
%5 determining if binding of the test compound affects RGS activity. This can be done, for 
m \ example, by assessing the affect of the test compound on RGS induced Ga GTPase activity. 

! = ;! The amino acid residues of RGS4-core that are affected by binding of the test compound 
If I define the binding site of the test compound. If the test compound is found to affect RGS4- 

«i core activity and the location to which the test compound binds in RGS4-core is not the Ga 

HO binding site, then the location to which the test compound binds is an allosteric binding site. 

One such allosteric binding site in the al-al region of RGS4-core has been identified using 
fi this method. 

The three-dimensional structure of any allosteric binding site identified by this 
method can then be employed in methods described herein to identify, select and design 
25 candidate agonists and antagonists of RGS activity and specifically of RGS subclass B 
activity. Test compounds for assessing the presence of allosteric sites in RGS can be 
members of a library that exhibit a range of structural feature (e.g., alicyclic rings, 
heterocyclic rings, aromatic rings, aliphatic, alicyclic compounds or aromatic compounds 
displaying various substituent groups (e.g., OH, -CO-,- NHCO-, etc.). Test compounds can 
30 also be selected in screens for compounds that are known to exhibit an affect on RGS activity 
(e.g., that enhance or retard the rate of RGS4-induced Ga GTPase). Initial screens can be 
performed by assessing mixtures containing a plurality of test compounds for an affect on 
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RGS activity. In cases in which an affect Is observed with the mixture of test compounds, the 
individual compounds can be re-tested individually to determine which test compound(s) 
affect RGS activity. 

In a specific embodiment, the invention provides a method in which the three 
dimensional structure of free RGS4~core is employed to identify chemical or biochemical 
species or fragments thereof capable of binding to an RGS protein. Once identified the 
species or fragments capable of binding to RGS are assembled (using well-known computer 
modeling techniques) into a single molecule to provide a structure of a potential antagonist or 
agonist. The molecule assembled can contain additional species or fragments (e.g., a 
backbone) for desired orientation of the species or fragments capable of binding to RGS. 
This method is particularly applicable to RGS subfamily B proteins. 

The invention further provides a method for identifying mutants of RGS4 proteins in 
which the activity of the mutant protein is different from that of RGS4. In this method the 
three-dimensional structure of free RGS4 is employed to identify amino acids that are 
involved in the regulation of G-protein signaling. One or more of the amino acid residues 
identified are then modified to generate a mutant RGS4. Mutants identified in this method 
are expected to exhibit altered function in the regulation of G-protein signaling. 

Other objects of the invention will be readily apparent from the following detailed 
description. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 illustrates the secondary structure of RGS4. The figure provides a summary 
of the sequential and medium range NOEs involving the NH ? Ha and H[J protons, the amide 
exchange and 3 JHNa coupling constant data, and the 13 Ca and 13 Cp secondary chemical 
shifts observed for RGS4 with the secondary structure deduced from this data. The thickness 
of the lines reflects the strength of the NOEs. Amide protons still present after exchange to 
D2O are indicated by closed circles. The open boxes represent potential sequential 
assignments NOEs which are obscured by resonance overlap and could therefore notbe 
assigned unambiguously. The gray boxes on the same line as the Ha(i)-NH(i+l) NOEs 
represents the sequential NOE between the Ha proton of residue i and the C8H proton of the 
1+1 proline and is indicative of a trans proline. Seven alpha helical regions are indicated (al- 
«7). 
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Figures 2 A and 2B are ribbon diagrams of the (A) x-ray structure of RGS4 from the 
RGS4-Gial complex, (B) NMR structure of free RGS4 for residues V5 to PI 34. The residues 
which exhibit a significant structural change between the RGS4-Gial x-ray structure and the 
free~RGS4 NMR structure are numbered. Residues Kl 16-Y1 19 correspond to key residues 
involved in the interaction with Gicd and the location of a structural change between the free 
and complexed forms of RGS are indicated. The C- and N-terminal regions which exhibit a 
change in secondary structure and helical packing are also indicated. The RGS4-Giccl x-ray 
structure is that of Tesmer et al. (8). The C- and N-terminal regions which incur a change in 
secondary structure and helical packing are indicated. The observed helical regions of the 
RGS4 structure are labeled. 

DETAILED DESCRIPTION OF THE INVENTION 
RGS proteins are regulators of G-protein signaling which affect the intensity and 
duration of the G-protein signal cascade by binding to the active Gct-GTP-Mg 2 complex to 
increase the' rate of GTP hydrolysis. RGS proteins act as attenuators of the induced G-protein 
signal by increasing the rate of inactivation of the G-protein and termination of the signal. 
RGS proteins have been identified in a wide range of eukaryotes, including humans. RGS 
proteins are highly diverse, multifunctional proteins characterized by the presence of a core 
region of approximately 130 amino acid residues (sometimes identified as having 120 amino 
acids), which may be separated by linker regions of varying lengths (79, 80, 9), that is 
conserved in all RGS proteins that have so far been identified. All RGS proteins that have 
been identified bind to members of the Gia class of G protein a subunits. The family of RGS 
proteins include RGS4, GAIP (human Gcc-interacting protein), RGS 1, RGS 10, RGS1 1, 
RGS 12, RGS 13, RGS 14, and RGS 16 (also called RGSr), Axin, Conductin, pi 15-RhoGEF, 
PD2-RhoGEF and LSC (86), among others, and contains more than 20 known members 
where specificity for Goc subtypes has been demonstrated and appears to be associated with 
subtle sequence differences (8, 14). RGS4 is believed to stabilize the transition state for GTP 
hydrolysis (17, 57, 21). The conserved region of RGS provides for binding to Gcc and can 
thus be used to identify species that affect (as agonists or antagonists) RGS binding to Ga and 
the activity of RGS as an attenuator of G-protein signaling. RGS proteins of this invention 
function in G-protein regulation by binding to the Ga subunit of a G-protein. RGS proteins 
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may, but need not, exhibit other biological functions. References 89 and 91 provided reviews 
of additional- biological functions exhibited by RGS proteins. 

The term "RGS protein" as used herein, including its use for specific RGS proteins 
and RGS protein subfamilies, includes native RGS proteins (and native RGS core proteins) 
isolated from or otherwise obtained from (e.g.^-by expression of cloned genes) from any 
natural sources, recombinant RGS proteins which may contain portions of RGS sequence and 
non-RGS sequence (e.g., RGS-core sequence with the hexahis pro-tag), variant RGS proteins 
which contain conservative amino acid sequence differences from a native RGS protein or in 
which sequences non-functional in RGS activity are deleted, mutant RGS proteins in which 
one or more amino acids have been modified by expression from a mutant RGS coding 
sequence. Mutants include, among others, those having one or more site specific mutations, 
those having one or more deletions and those having one or more insertions compared to a 
native RGS protein (or RGS-core) or variant RGS (or variant RGS-core), The term mutant 
RGS refers in particular to those proteins having the described mutations, insertions or 
deletions in the RGS core region. Variant RGS proteins are expected to have biological 
function for G-protein regulation substantially the same as that of the native RGS protein 
from which they are derived. Mutant RGS proteins include those which have biological 
function substantially the same as or modified from that of a native or variant RGS protein 
from which they are derived. Variant, derivative, recombinant and mutant RGS proteins do 
not necessarily represent mutually exclusive subsets of proteins. 

As noted herein, the RGS-core region is involved with RGS function in G-protein 
regulation, the term RGS proteins as used herein include RGS proteins in which non- 
functional regions are absent, e.g., RGS-core regions of native, recombinant, variant or 
mutant RGS proteins. The RGS core region of a native RGS has been found to retain full 
native RGS activity (8). The core region of RGS4 is approximately 130 amino acids in 
length. (References may also refer to conserved or core regions of RGS as having a length of 
approximately 120 amino acid) RGS cores from other RGS proteins can differ in length from 
that of RGS4. RGS proteins of this invention can be obtained by in vitro or in vivo ^ 
expression of an RGS coding sequence by isolation from natural sources or any other means 
known in the art. 

Known RGS proteins are categorized into six or seven subfamilies on the basis of a 
phylo-genetic analysis of 61 mammalian and invertebrate RGS proteins (86). Mammalian 
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RGS proteins are composed of at least six subfamilies designated A-F as follows: A (GAIP, 
Ret-RGS 1, RGS21); B (RGSI, RGS2, RGS3, RGS4, RGS 5, RGS8, RGS 13 and RGS 16 [also 
called RGS-r]; C (RGS6, RGS7, RGS9 and RGS1 1); D (RGS 12, RGS 14); E (Axin and 
Conducting and F (pi 15-RhoGEF, PD2-RhoGEF and Lsc). Two other RGS proteins TGS10 
5 and D-AKAP2 are structurally diverse from those of subfamilies A-F and may represent a 
separate subfamily. Subfamilies B, C and D all have characteristic residue Asn (N82 in 
RGS4 as numbered herein, or NI28 as numbered in Tesmer et ah [8]), which is associated 
with Ga binding at least in RGS subfamily B proteins. RGS proteins of subfamily A are 
substituted at this position in the RGS core with a serine (SI 56 in GAIP). Additionally, the B 
10 subfamily of RGS proteins is reported to have another characteristic residue, a serine (at 

position 57 in RGS4 as numbered herein and SI 03 as numbered in Tesmer et aL [8]). RGS4 
represents the B subfamily of RGS proteins and is structurally more similar to and believed to 
have biological activity and function more similar to other members of the B subfamily, 
Jg including RGS1, RGS2, RGS3, RGS5, RGS 8, RGS 13 and RGS 16. GAIP, for example, is 
1p> representative of the A subfamily of RGS proteins and is structurally more similar to and 
fU believed to have biological activity and function more similar to other members of the A 

subfamily including Ret-RGS 1 and RGSZ1 . Thus, the term RGS subfamily B refers to 
m RGS1 , RGS2 > RGS3, RGS4, RGS5, RGS8, RGS 13, RGS 16 and other, as yet 
□ uncharacterized, RGS proteins that exhibit structural features characteristic of the B 

SO subfamily and which are classifiable into the B subfamily by phylogenetic analysis as 
D described in Zheng, B. et al. (1999) supra . Analogously, the term RGS subfamily A refers to 

I[ GAIP, Ret-RGS 1 and RGS2 1 and other RGS proteins as yet uncharacterized that exhibit 

structural characteristics of the A subfamily and which are classified as RGS subfamily A 
proteins by phylogenetic analysis. RGS subfamilies OF have analogous definitions. 
25 The term RGS4 refers to RGS4 exemplified by RGS4 of rat (Tesmer et al (1997) 

supra) and homologs thereof including, among others, human RGS4 and mouse RGS4. The 
RGS core region of human, rat and mouse RGS4 differ from one another by 2-4 amino acids 
(representing about 97% or more sequence identify in the 130 amino acid core). Honiologs 
of GAIP can exhibit as low as about 85% sequence identity in the RGS core region. An 
30 RGS4 homolog may, thus, exhibit RGS4 core sequence identify as low as about 85% with rat 
RGS4. 
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RGS protein NMR studies and structural determinations herein were performed using 
an RGS4 -core protein consisting of the conserved region of RGS4 (specifically that -derived 
from rat) with a N-terminal methionine and a C-terminal hexahistidine tail. The three- 
dimensional solution structure determined for the RGS4-core protein, assuming the 
possibility of conservative amino acids changes and within ± a root mean square deviation of 
the relative structural coordinates of the backbone atoms listed in Table 2 of not more than 

a o o 

1.5 A (or more preferably, not more than t.OA, or most preferably, not more than 0.5 A), 
model the three-dimensional solution structures of other RGS4 proteins of any eukaryotic 
origin, including human RGS4. Further, because of the significant conservation of this 
domain among different RGS proteins, the three-dimensional structure of RGS4-core 
provided herein, again assuming conservative amino acids changes, and within ± a root mean 
square deviation of the relative structural coordinates of the backbone atoms of the structure 
of not more than L5A (or more preferably, not more than 1.0A, or most preferably, not more 
than 0.5 A), models the structures of the conserved region in other RGS proteins of all origins. 

"Structural coordinates" are the Cartesian coordinates corresponding to an atom's 
spatial relationship to other atoms in a molecule or molecular complex. Structural 
coordinates may be obtained using NMR techniques, as described herein or as known in the 
art, or using x-ray crystallography as is known in the art. Alternatively, structural coordinates 
can be derived using molecular replacement analysis or homology modeling. Various 
software programs allow for the graphical representation of a set of structural coordinates to 
obtain a three dimensional representation of a molecule or molecular complex. The structural 
coordinates of the present invention may be modified from the original sets provided in Table 
2 by mathematical manipulation, such as by inversion or integer additions or subtractions. As 
such, it is recognized that the structural coordinates of the present invention are relative, and 
are in no way specifically limited by the actual x, y, z coordinates of Table 2. The structural 
coordinates of Table 2 ± a root mean square deviation from the conserved backbone atoms of 
the amino acids therein(or conservative substitutions thereof) of not more than 1.5 A (or more 
preferably, not more than 1.0A, or most preferably, not more than 0.5A) define or embody the 
three-dimensional structure of free RGS4 (i.e., not complexed with another molecule) in 
solution. The RGS4 core conserved region contains a Ga binding site and an allosteric 
binding site. Amino acid sequences can be inserted between the helical regions of the RGS 
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core region without significantly altering the biological function of the RGS protein. RGS 
proteins of lower eukaryotes contain such insertions. 

"Root mean square deviation" is the square root of the arithmetic mean of the squares 
of the deviations from the mean, and is a way of expressing deviation or variation from the 
structural coordinates described herein. 

As used herein, "RGS activity/* "activity of RGS" and other similar terms refer to the 
ability of RGS to bind to an active Ga-GTP-Mg 2 complex and induce a change in the rate of 
GTP hydrolysis. Any other biological function or activity of an individual RGS protein will 
be specifically defined herein. References 89 and 91 are incorporated by reference herein for 
their review of the additional biological functions of certain RGS proteins. Any assay which 
measures the rate of GTP hydrolysis in a Goc-GTP-Mg 2 complex in the presence and absence 
of RGS (or portions thereof) can be used to measure such activity. A preferred assay method 
measures precipitated radiolabeled phosphate that results from hydrolysis of Ga-[y- 32 P]- 
GTP-Mg 2 as described in the Examples herein. 

Table 2 lists the atomic structure coordinates for the restrained minimized mean 
structure of free RSG4 as derived by NMR spectroscopy. The first two columns in Table 2 
list atom number, the third column identifies the atom type using standard nomenclature, the 
fourth and fifth columns list the amino acid and its number in the sequence. The sixth, 
seventh and eighth columns of the table are relative coordinate values (in three dimensions). 

It will be obvious to the skilled practitioner that the numbering of the amino acid 
residues in the RGS4 and other RSG proteins covered by the present invention may be 
different than that set forth herein. The RGS4 core protein used herein contains an RGS core 
domain with an N-terminal Met and a six residue histidine tag at the C-terminus. In Fig. 1 
the amino acid sequence of the RGS4 core protein used is numbered beginning at the N- 
terminal Met. For comparison to the full-length RGS4 sequence (for example, as numbered 
in Tesmer et al. (1997) (8)) add 46 to the numbering used herein. 

It will also be obvious to the skilled practitioner that RGS proteins and portions 
thereof covered by this invention may contain certain conservative amino acid substitutions 
that yield the same three dimensional structures as those defined by the structural coordinates 
provided herein ± a root mean square deviation from the conserved backbone atoms of the 
amino acids therein(or conservative substitutions thereof) of not more than 1.5A. Ammo 
acids in other RGS proteins or peptides corresponding to those in RGS4 and conservative 
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substitutions in other RGS proteins or peptides are readily identified by visual inspection of 
theTelevant-amino acid sequences or by using commercially available homology software 
programs. "Conservative substitutions" are those amino acid substitutions which are 
functionally equivalent to the substituted amino acid residue, either by way of having similar 
polarity, steric arrangement, or by belonging to the same class as the substituted residue (e.g., 
hydrophobic, acidic or basic), and includes substitutions having an inconsequential effect on 
the three dimensional structure of RGS with respect to the use of said structure for the 
identification and design of RGS antagonists or agonists, and for molecular replacement 
analyses and/or for homology modeling. 

The structural coordinates of the present invention permit the use of various molecular 
design and analysis techniques in order to (t) solve the three dimensional structures of related 
RGS proteins, peptides or complexes thereof, and particularly RGS subfamily B proteins, 
peptides or complexes thereof and (ii) to select, design and synthesize or otherwise obtain 
chemical and biochemical species capable of associating, binding or interacting with RGS 
potentially having function as antagonists or agonists of an RGS, Gcc or an RGS-Ga complex. 

Molecular replacement analysis is a well-known technique employed in x-ray 
crystallography which uses the x-ray structure of a molecule having as a starting point to 
model a molecule whose crystal structure is unknown. This technique is based on the 
principle that two molecules which have similar structures, orientations and positions will 
diffract x-rays similarly. A corresponding approach to molecular replacement is applicable to 
modeling an unknown solution structure using NMR technology. The NMR spectra and 
resulting analysis of the NMR data for two similar structures will be essentially identical for 
regions of the molecules that are structurally conserved, where the NMR analysis consists of 
obtaining the NMR resonance assignments and the structural constraint assignments, which 
may contain hydrogen bond, distance, dihedral angle, coupling constant, chemical shift and 
dipolar coupling constant constraints. Appropriate NMR spectra are accumulated for a 
solution of the species of unknown structure and compared to NMR of the species of known 
structure. The observed differences in the NMR spectra of the two structures will highlight 
the differences (and similarities) between the two structures and identify the corresponding 
differences in the structural constraints. The structure determination process for the unknown 
structure is then based on modifying the NMR constraints from the known structure to be 
consistent with the observed spectral differences. This method is applicable to the 
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determination of three-dimensional solution structures of any RGS protein or peptide using 
the structural information for RGS4 provided herein. The method is most appropriate for 
determining the structures of RGS proteins that are expected to have significant structural 
similarity with RGS4. For example, this invention specifically provides the three- 
dimensional structure of a rat RGS4-core region in solution. The replacement method 
described above can be employed to determine the three-dimensional structure of the human 
RGS4-core which differs from that of rat by 2 amino acids in the 130 RGS core (at positions 
22 N (rat) >S (human), and 132 T(rat) > V(human), referring to the rat sequence given in Fig. 
1. 

Accordingly, in one nonlimiting embodiment of the invention, the NMR resonance 
assignments for RGS4 provide the starting point for resonance assignments of other RGS 
family proteins (or portions thereof), that are expected to be structurally similar to RGS4, 
e.g., RGS4 homologs from different organisms or more generally RGS proteins of the 
subfamily B. Chemical shift perturbations can be detected using one or two dimensional 
spectra (e.g.-, ^N/ 1 !!, i3 C/ ! H spectra) or using other methods well known in the art and 
compared between RGS4 and another RGS protein. In this way, the affected residues may be 
correlated with the three dimensional structure of RGS4 as provided by the relevant residues 
of Table 2. This effectively identifies the region of the other RGS protein or peptide that has 
a structural change relative to the RGS4 protein. The J H, 15 N, l3 C and n CO NMR resonance 
assignments corresponding to both the sequential backbone and side-chain amino acid 
assignments of the other RGS protein, or portion thereof, can then be obtained and the three 
dimensional structure of this protein, or portion thereof, can be generated using standard 2D, 
3D and 4D triple resonance NMR techniques and NMR assignment methodology, using the 
RGS4 structure, resonance assignments and structural constraints as a reference. Various 
computer fitting analyses of the other RGS protein or peptide with the three dimensional 
model of RGS4 can be performed in order to generate an initial three dimensional model of 
the other RGS protein or peptide, and the resulting three dimensional model may be refined 
using standard experimental constraints and energy minimization techniques in order to 
position and orient the other RGS in association with the three dimensional structure of 
RGS4. % 

The present invention further provides that the structural coordinates of the present 
invention can be used with standard homology modeling techniques in order to determine the 
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unknown three-dimensional structure of an RGS protein or portion thereof. Homology 
modeling, as is known in the art, involves constructing a model of an unknown structure 
usipg structural coordinates of one or more related protein molecules, molecular complexes 
or parts thereof (i.e., active sites). Homology modeling may be conducted by fitting common 
5 or homologous portions of the protein whose three dimensional structure is to be solved to 
the three dimensional structure of homologous structural elements in the molecule of known 
three-dimensional structure, specifically using the relevant (i.e., homologous) structural 
coordinates provided by Table 2. Homology can be determined a variety of known methods, 
for example, using amino acid sequence identity, homologous secondary structure elements, 
10 and/or homologous tertiary folds. Tesmer et aL (1997) (8) and Druey and Kehrl (1997) (88) 
provide examples of multiple sequence alignments of RSG protein sequences. Homology 
modeling can include rebuilding part or all of a three dimensional structure with replacement 
of amino acids (or other components) by those of the related structure to be solved. 
2s Molecular replacement analysis as adapted and applied to NMR structural data (as discussed 

135 above) and homology modeling are techniques that are well known in the art which can be 
readily applied or adapted to the determination of the three dimensional structures of other 
m proteins of the RGS family (and portions thereof e.g., Ga binding sites and/or allosteric 

^ ;! binding sites). These methods are particularly useful for determining RGS solution structure 

Is 

13 within the conserved region of the protein based on the RGS4 three-dimensional solution 

JKO structure. These methods are applicable to presently known members of the RGS family of 
w proteins as well as to proteins, particularly those of RGS sub-family B, as yet unidentified as 

JT RGS proteins, and particularly those that exhibit significant sequence identity above 60% or 

more, preferably 85% or more sequence identity in the RGS-core region. NMR assignments, 
structural coordinates and three-dimensional structures of RGS proteins or peptides, 
25 determined using molecular replacement analysis and homology modeling based on the 

structural coordinated and NMR assignments provided herein and optionally refined using a 
number of techniques well known in the art, can be employed in a similar fashion to the 
structural coordinates of Table 2 for identifying, selecting or designing chemical species that 
are antagonists or agonists of RGS, Ga or RGS Ga complexes. 
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Description of the Structure of RGS4 

The primary amino acid sequence of several RGS4 proteins are known. The-amino 
acid sequence of RGS4-core (from rat) with attached hexahis pro tail is listed in Fig. 1 (as 
SEQ ID No. 1). The regular secondary structure elements of free RGS4 were identified from 
5 a qualitative analysis of sequential and inter-strand NOEs, NH exchange rates, z J ma coupling 
constants and the I3 Ccc and t3 Cp secondary chemical shifts (47, 48). The sequential and 
medium NOEs were obtained from a qualitative analysis of the l5 N-edited NOESY and n C- 
edited NOESY spectra. 3 coupling constants were obtained from the relative intensity of 
Ha crosspeaks to the NH diagonal in the HNHA experiment (18). Slowly exchanging NH 
10 protons were identified by recording an HSQC spectra two hours after exchanging an RGS4 
sample from H2O to D2O. These data, together with the deduced secondary structure 
elements are summarized in Fig. 1 . 

The overall structure of RGS4 is composed of seven helical regions corresponding to 
f residues 7-12 (a,); 17-36 (a 2 ); 40-53 (a 3 ); 61-71 (a 4 ); 86-95 (a 5 ); 105-125 (a 6 ) and 128-132 

=115 (a 7 ). A simple description of the RGS4 topology is that the protein consists of two pseudo 4- 
}i= helix bundles with an up-down-up-down arrangement where helical region six is part of both 

OH bundles. An unusual feature of the RGS4 structure occurs in the second helical region. There 

5 is a one residue (H23) -90° bend in the helix which effectively divides this helical region 

5! into two separate helices (as described in the RGS4 x-ray structure (5)). This one residue 

PUZO bend was not obvious from the NMR analysis of the secondary structure data (Fig. 1) where it 

i. if 5 ] 

appears to be a continual helical stretch- The bend only became apparent during the structure 
N : refinement process. Some observable NOEs that contribute to the>bend at H23 occur between 

residues -L20, 121, and residues G26, L27 ? A29, and F30. The bend at H23 effectively allows 
for appropriate packing of these hydrophobic side-chains. 

25 Additional bends or turns occur throughout the RGS4 structure. Helical regions 0^ 

and a 2 are connected by residue SI 6 that adopts an extended conformation allowing these two 
helices to be essentially parallel. This is very similar to the turns connecting helical regions 
<x 3 and a 4 and helical regions a 5 and a 6 . Conversely, helical regions a 2 and a 3 are connected 
by Y38 that has a positive <}> torsion angle, suggesting a P type turn. The conformation of 

30 Y38 results in an angle between helical regions 0^ and a 3 of -45°, which also represents a 
transition point between the two pseudo 4-helix bundles. The longest loop in the structure 
occurs between helical regions <x 4 and a 5 . This loop region is well ordered based on high 
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order parameters (S 2 > 0.6). The low mobility for this loop results from interactions with 
helical regions cc 3 and a 6 . The observed bend between the longest helical region ct 6 and the 
shortest helical region ct 7 is suggestive of a distortion in this helical segment to achieve an 
optimal packing interaction between helical regions a l and ce 7 . The end result of these local 
conformations on the overall topology of RGS4 is to create an elongated structure where the 
two pseudo 4-helix bundles are nearly perpendicular. The interface between these the two 
pseudo 4-helix bundles is predominately hydrophobic in nature (L17, 121, L27, F30, L34, 
W46, 147, II 10, Fl 1 1, LI 13, Ml 14) consistent with the general packing of hydrophobic 
residues in the core of the protein with charged residues on the protein surface. 

As previously described, the primary biological function for RGS4 is to bind G ial and 
stimulate its intrinsic GTPase activity. Key residues in the RGS4 structure that are involved 
in the interaction of RGS4 with G ial correspond to RGS4 residues S39, E41, N42, LI 13, 
D117, S118,andR121 that form the binding pocket for T 182 from G icl . Similarly, N82 
from RGS4 binds into the G ic! active site interacting with residues Q204, S206 and E207 (8) 
of G jct . RGS4 mutational work support the functional importance of these residues in the 
binding and activity of RGS4 with G ittl while identifying N82 to be critical in facilitating 
GTP hydrolysis (18-20). RGS4 residues S39, E41 and N42 are located in the N-terminal end 
of helical region cc 3 while LI 13, Dl 17, SI 18, and R121 are located directly opposite at the C- 
terminal end of helical region a 6 , N82 is located approximately in the center of the structured 
loop region between helical regions oc 4 and ce 5 which is positioned relatively above the T182 
binding pocket on RGS4. 

Another feature of the RGS4 structure is the observation that residues M1-S4 and 
P134-H166 are completely disordered and dynamically flexible. Structure coordinates for 
these atoms are not included in Table 2. This is evident by the sharp line-widths and the 
minimal number of observable NOEs. The flexible nature of these residues are further 
supported by I5 N Tl, T2 and NOE measurements which indicate low order-parameters (S 2 < 
0.6) 

RGS4 Structure Determination 

The final 30 simulated annealing structures were calculated on the basis of 2871 
experimental NMR restraints consisting of I960 approximate interproton distance restraints, 
78 distance restraints for 39 backbone hydrogen bonds, 431 torsion angle restraints comprised 
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of 151 <j>, 154 97 xi, 29 X2 torsion angle restraints, 132 3 J NHo restraints and 136 Ca and 
134 Cp chemical shift restraints. Stereospecific assignments were obtained for 58 of the 125 
residues with p-m ethylene protons, for the methyl groups of 3 of the 5 Val residues, and for 
the methyl groups of 9 of the 12 Leu residues. In addition, 7 out of the 8 Phe residues and 4 
out of the 5 Tyr residues were well defined making it possible to assign NOE restraints to 
only one of the pair of CSH and CsH protons and to assign a X2 torsion angle restraint 
Comparison of the Free RGS4 NMR Structure with the RGS4 G iai Bound Structure 

Figs. 2A and 2B are ribbon diagrams of (A) the x-ray structure of RGS4 complexed to 
Gial (8) and (B) the solution structure of RGS4 as determined by NMR methods. Residues 
that effect significant structural change between the two structures are indicated. An 
unexpected result from determining the solution structure of RGS4 in the absence of G ictl was 
the observation of a significant change in the conformation for free RGS4 relative to RGS4 in 
the complex (J). A fundamental factor in the difference between the two structures is a 
perturbation in the secondary "structure elements. Consistent with the RGS4-G ial x-ray 
structure, the NMR structure of free RGS4 is an a-helical protein comprised of two peudo 4- 
helix bundles. The NMR data shows that free RGS4 is composed of seven helical regions and 
a majority of this data is consistent with the RGS4-G icd x-ray structure. The significant 
difference between the two secondary structures occurs within the C-terminal helical regions 
a 6 and oc 7 . In the RGS4-G ia] x-ray structure residues V5 to T132 are observed in (i.e., they 
are ordered) and residues 104-1 16 and 1 19-129 are helical. This contrasts with the free RGS4 
NMR structure where residues 5-133, 105-125 (a 6 ) and 128-132 (a 7 ) are helical. There is a 
significant shift in the helical structure in this region containing residues 104-133. 

The observed structural change between the free RGS4 NMR structure and the RGS4- 
G [a { x-ray structure is a movement of a kink between helical regions a 6 and cc 7 towards the C- 
terminus. The movement of this kink results in ct 6 being longer by nine residues and a 7 being 
shorter by six residues in the free RGS4 NMR structure. Additionally, cc 7 of free RGS4 
extends three residues beyond what was observed as a structural region in the RGS4-G iai x- 
ray structure. 

The observed change in the secondary structure, although only involving a few C- 
terminal residues, has far-reaching effect, since it results in a significant modification in the 
overall fold for RGS4. This is evident from a 1.94 A backbone rms difference between the 
RGS4-G ial x-ray structure and the free RGS4 NMR structure for residues 5-134. The major 
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effect of the alteration in secondary structures is a reorganization of the packing of the N- 
terminal and~C-terminal helix as is evident from per-residue backbone atomic rms differences 
between the free RGS4 NMR structure and the bound RGS4 x-ray structure. Therefore, the 
accuracy of the secondary structure interpretation is important for proper analysis of the free 
RGS4 structure. The reliability of the NMR secondary structure is demonstrated by the 
extensive data summarized in Fig. 1 . Residues 105-125 and 128-132 show a continual stretch 
of NMR data consistent with an ot-helical definition with an abrupt break in this information 
for residues 125-128. Furthermore, the significant differences between the - and C-terminal 
regions of the free RGS4 NMR structure and the bound RGS4 x-ray structure are indicated by 
a large number of interproton distance (145) and torsion angle (39) violations and by the 
corresponding very high values for the NOE and torsion angle restraint energies exhibited by 
the bound RGS4 x-ray structure. The self-consistency of the NMR data using NOEs, 
coupling constants, NH exchange rates and secondary carbon chemical shifts and the large 
number of restraint violations with the bound structure, demonstrate the accuracy of the 
RGS4 NMR" structure provided. Comparisons of the free RGS4 structure of this invention 
with the structure of the RGS4-Gial complex should then provide an accurate description of 
the conformational changes that occur in on RGS4 on binding to Gial. 
Relevance to Activity for the RGS4 Conformational Change 

RGS4 is involved in the regulation of the G iai GTPase cycle having a modest affinity 
for GTP-Ga, but not binding to GDP-Ga. It is believed that the observed conformational 
changes for free RGS4 are related to modulating its affinity to G ial to allow for perpetuation 
of the GTPase cycle. This role for the RGS4 conformational change is evident by the fact 
that the RGS4 Get binding site is the location of the G ial induced structural perturbation. The 
pronounced kink between helical regions a 6 and a 7 observed in the bound RGS4 x-ray 
structure occurs at residues D 1 1 7 and S 1 1 8. RGS4 molecular surfaces for both the free 
RGS4 NMR structure and the RGS4-G faJ x-ray structure in the vicinity of the G ial T182 
binding pocket were calculated. A comparison of the two RGS4 molecular surfaces, shows 
that the G ial T182 binding pocket is larger and more accessible in the free RGS4 NMR 
structure. Also, in the RGS4-G iai x-ray structure there appears to be a molecular surface 
"wall" composed of the RGS4 sidechains from residues Dl 17, SI 18 and R121 which 
surround the G ia , T182 binding pocket. These residues form an important hydrogen-bonding 
network which is critical for the binding of RGS4 with G ial where Dl 17 forms a hydrogen 
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bond with R121 and the backbone nitrogen of G ial T182. The critical nature of these residues 
is fkrther supported by mutagenesis. Alanine mutations of Dl 17 and R121 diminishes RGS4 
activity and binding to G iaI . Since the helical kink at residues Dl 17 and SI 18 is less 
pronounced in the free RGS4 NMR structure and a disruption in the helix occurs instead 
between residues 125-128, the sidechains for Dl 17, Si 18 and R121 are well beyond 
hydrogen-bonding distance. It is evident from the free RGS4 NMR structure that the network 
of sidechain interactions with G iat T182 in the absence of G iai is not pre-formed. 

The observation that RGS4 undergoes a significant structural change in the presence 
of G ictl where the focal point of this change occurs at key residues in the RGS4- G ial interface 
creates a different explanation for the process of RGS4 activation of G iotl GTPase activity. 
This information suggests a two-stage process composed of a binding and locking step. 
Because the G ictl T182 binding pocket is clearly more accessible in the free RGS4 NMR 
structure, the binding step appears to be driven by the fit of T 182 into this pocket. The 
locking step then results from the induced conformational change in the RGS4 structure 
where the pronounced kink in the helix between residues Dl 17 and SI 18 brings these 
residues into close contact with R121 and G ial T182 to form the hydrogen bonding network 
observed in the RGS4-G ial x-ray structure. Tl 82 in the binding pocket induces the formation 
of a hydrogen bonding network and the resulting RGS4 conformational change as opposed to 
a pre-formed binding site suggested from the RGS4-G iaI x-ray structure. The release of RGS4 
from G ial would then require the removal of G taI T182 from the RGS4 binding pocket which 
presumably occurs during GTP hydrolysis. This mechanism is consistent with a local 
perturbation in the vicinity of T 182 seen between the GDP- C ial (2) and RGS4- G ial x-ray 
structures where this localized movement appears to be sufficient to remove T182 from the 
RGS4 binding site and disrupt the hydrogen-bonding network resulting in dissociation of the 
complex. Comparison of the Tl 82 G iaI region between the RGS4- G ia! and the GDP-AIF 4 - 
G iai x-ray structures (4) indicate that these two structures are essentially identical in this 
region of G ial . Since the GDP-A1F 4 -G ici f structure corresponds to the active form of GDP- 
A1F 4 -G fal as well as the conformation that RGS4 preferentially binds, the similarity between 
these two structures is also consistent with the proposed mechanism for the activity of RGS4. 

The x-ray structure of RGS4 complexed with G ial in conjunction with other Ga 
conformers suggest that the role of RGS4 in stimulating Ga GTPase activity is accomplished 
by stabilizing the GTP hydrolysis transition state. The NMR structure of free RGS4 reported 



here expands this mechanism suggesting that the RGS4 induced conformation in the presence 
of G iaI may he related to its GTP-G iaL specificity which facilitates binding turnover that is 
critical for perpetuating the GTPase cycle. The described structural change in RGS4 provides 
an elegant mechanism for the observed binding selectivity between the various Ga 
conformers despite the close similarity in these structures. 
Detection of an Alios teric binding site in RGS4 

Several small molecule inhibitors of the RGS4-Ga interaction were identified in a 
large scale screening based on detection of inhibition of Ga GTPase function which implies 
inhibition of the binding of RGS4 to Ga. One of these compounds (designated compound 1 
for convenience) exhibited 100% inhibition of binding. The nature of the activity of 
compound 1 and its ability to inhibit RGS4 binding to Ga was further investigated by *H- I5 N 
HSQC chemical shift perturbation experiments. A total of five compounds, three that had 
exhibited inhibition of RGS4 binding in the screen and two controls that showed no activity 
in the screen were examined. 2D l H- 15 N HSQC spectra were collected for a 15 N-enriched 
RGS4 sample and a series of 15 N-enriched RGS4 samples titrated with one of the three test 
compounds and two controls. Comparison of the HSQC spectra of a free RGS4 sample and 
each of the samples titrated with a potential inhibitor allowed the identification of any 
chemical shift changes for RGS4 in the presence of the test and control compounds. In such 
an analysis the observation of a change in the position shape or intensity of a resonance 
indicates perturbation. With the NMR instrumentation employed a shift of half a line width 
in peak position could be reliably detected. Only in NMR spectra taken of RGS4 in the 
presence of compound 1 were any chemical shift perturbations observed indicating that 
compound 1 directly binds to RGS4. Employing the chemical shift assignments for free 
RGS4 (Table 1), the binding site of compound 1 in RGS4 was identified. RGS4 amino acid 
residues VI 0, W13, 117, 120, H23, E24, C25 and T125 exhibited a chemical shift perturbation 
in the presence of compound 1. 

The observed chemical shift perturbations did not arise from a pH change caused by 
addition of compound 1 to the RGS4 solution. ( J H- l5 N HSQC spectra of free RGS4 Xaken 
over a pH range of 5.5-6.5 indicate that none of the amino acid residues listed above was 
sensitive to pH changes over this range). The binding site for compound 1 corresponds to 
residues in the al-a2 region of RGS4 (where the al-a2 region includes the tight turn 
between the two helices). In the three-dimensional structure of RGS4, the binding region is 



positioned on the opposite surface from the G<x binding site. No amino acids residues 
associated with the Gcc binding site exhibited any chemical shift perturbation in the presence 
of compound 1. This indicates that the structure of the RGS4 Ga binding site is unchanged 
in the presence of compound 1. Compound 1 was found to significantly decrease the 
expected GTPase activity of Ga which combined with the fact that compound 1 binds at a 
site distal from the Ga binding site indicates that compound 1 is an allosteric inhibitor of 
RGS4 and that there is an allosteric binding site in the al-a2 region of RGS4. 
It is believed that binding of compound 1 at the allosteric binding site stabilizes RGS4 in the 
free form and effectively locks the RGS4 protein in free form. Compound 1 prevents the 
formation of a hydrogen-bonding network around the Ga T182 binding pocket. 

The solution structural information provided herein, including the secondary and 
tertiary structure of RGS4-core, the RGS4 Ga binding site, the a6 - a7 region, and the 
allosteric binding site in the al-a2 region of RGS4 can all be employed in methods described 
herein and methods well known in the art to identify, select or design candidate agonists and 
antagonists of RGS4 activity which in turn affects G-protein signaling functions in various 
eukaryotic cells and organisms. 

The following examples are provided to further illustrate the invention and are not 
intended to limit the invention, 

EXAMPLES 
The following abbreviations are used herein: 

G-proteins, heterotrimeric guanine nucleotide-binding proteins; RGS4, Regulators of 
G-protein Signaling; G ial , Ga subunit of heterotrimeric G proteins, G fol -AIF 4 ", Ga subunit of 
heterotrimeric G proteins complexed with Mg 2+ , GDP and A1F 4 * stabilized in the transition 
state for GTP hydrolysis, DTT, DL-l,4-Dithiothreitol; GTP, guanosine triphosphate; GDP, 
guanosine diphosphate; NMR, nuclear magnetic resonance; 2D, two-dimensional; 3D, three- 
dimensional; HSQC, heteronuclear single-quantum coherence spectroscopy; HMQC, 
heteronuclear multiple-quantum coherence spectroscopy; TPPI, time-proportional phase 
incrementation; NOE, nuclear Overhauser effect; NOESY, nuclear Overhauser enhanced 
spectroscopy; COSY, correlated spectroscopy; HNHA, amide proton to nitrogen to CaH 
proton correlation; HNHB, amide proton to nitrogen to CpH proton correlation; CT-HCACO, 
constant time CaH proton to a-carbon to carbonyl correlation; HACAHB, CaH proton to a- 
carbon to C{3H proton correlation. 
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Example 1 : Assignment of NMR Peaks and Secondary Structure Determination 

The RGS core domain of RGS4 was expressed in Escherichia coli (Jl 09) using the 
prokaryotic expression vector pQE50 (Qiagin, Valencia, CA). PCR was used to amplify and 
add' a C-terminal hexahis -pro tag to the RGS core (here residues 51-206 of RGS4) and the 
product was ligated between the BamHl and Sail sites of pQESO to give plasmid pRGS4. 
E. coli (BL21(DE3)) containing pRGS4 were grown in LB broth supplemented with 100 
pg/mL ampicillin. An overnight culture was diluted 1 :20 and grown at 37°C to an A 600 of 0.6 
- 0.8 with vigorous shaking. Isopropyl p-D-galactoside (1PTG) was added to a final 
concentration of 1 mM and cultures were shaken for 3 h at 37 °C. The cells were harvested 
by centrifugation (7000 x g) for 15 min. at 4°C, washed with PBS and stored at -70 °C. 

Uniformly (>95%) and 13 C-labeIed recombinant RGS4-core (containing the 166 
amino acid core domain of RGS4 with an N-teiminal methionine and C-terminal hexahis-pro 
tag) was obtained by growing BL21 (DE3) E. coli in defined medium containing 2.0 g/L 
[ 13 C6, 98% +] D-glucose and 1.0 g/L [ l5 N, 98%+] ammonium chloride as sole carbon and 
nitrogen sources, respectively. In addition, the defined medium contained M9 salts, trace 
elements, vitamins and 100 jxg/L ampicillin. Conditions for induction and growth are as 
described above. The recombinant RGS4-core protein was purified using affinity 
chromatography on a 10 mL Ni 2+ column and purified to homogeneity following ion- 
exchange chromatography on Resource S at pH 5.5. Protein was desalted into appropriate 
buffer prior to use. N-terminal amino acid sequencing was performed to confirm protein 
identity and uniform labeling of RGS4-core was confirmed by MALDI-TO mass 
spectrometry (Perceptive Biosystems). 

The NMR samples contained 1 mM of RGS4 manually purified-core protein in a 
buffer containing 50mM K 2 P0 4 , 2mM NaN 3> and 50 mM deuterated DTT, in either 90% 
H2O/ 10% D2O or 100% D2O at pH 6.0. 

AH spectra were recorded at 30 - 35° C on a Brucker AMX-2 600 spectrometer using 
a gradient enhanced triple-resonance 'H/^C/^N probe. For spectra recorded in H 2 0, water 
suppression was achieved with the WATERGATE sequence and water-flip back pulses (23, 
24). Quadrature detection in the indirectly detected dimensions were recorded with States- 
TPPI hypercomplex phase increment (25). Spectra were collected with appropriate refocusing 
delays to allow for 0,0 or -90,180 phase correction. Spectra were processed using the 
NMRPipe software package (28) and analyzed with PIPP {29), NMR Pipe and in a peak 
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sorting program— on a Sun UltralO Workstation. When appropriate, data processing included 
a solvent filter, zero-padding data to a power of two, linear predicting back one data point of 
indirectly acquired data to obtain zero phase corrections, linear prediction of additional points 
for the indirectly acquired dimensions to increase resolution, linear prediction by the means 
of the mirror image technique was used only for constant-time experiments (38). In all cases 
data was processed with a skewed sine-bell apodization function and one zero-filling was 
used in all dimensions. 

The assignments of the ! H, l5 N, l3 CO, and l3 C resonances were based on the following 
experiments: CBCA(CO)NH (62), CBCANH (63), C(CO)NH (64), HC(CO)NH (64), 
HBHA(CO)NH (65), HNCO (66), HCACO (29), HNHA (26), HNCA (67), HCCH^COSY 
(68) and HCCH-TOCSY (69) (for reviews see: Bax et al 1994 and Clore and Gronenbonn, 
1994). The resonance assignments of RGS4 essentially followed the semi-automated 
protocol described previously (37, 70, 71). The accuracy of RGS4-core assignment was 
further confirmed by sequential NOEs in the 1 5 N-edited NOESY-HMQC spectra. Because 
the RGS4 structure is exclusively a -helical, the sequential NH r NH M NOEs were extremely 
useful in completing the RGS4 backbone assignments. l H, 15 N, l3 C AND 13 CO assignments 
for RGS4-core are summarized in Table 1 . 

The backbone ! H, l5 N, 13 CO, and l3 C assignments in Table 1 are essentially complete 
for the RGS4-core. As noted above, the native core sequence was appended to six histidines. 
The last five histidines were the only unassigned residues in the protein. The ability to obtain 
the complete assignments for RGS4-core implies a well-packed ordered structure. The side- 
chain assignments are also nearly complete; the majority of missing information is in residues 
with long side-chains which are potentially solvent exposed. 

The secondary structure of the RGS4-core (summarized in Fig. 1) is based on 
characteristic NOE data involving the NH, Ha and HP protons from 15 N-edited NOESY- 
HMQC and I3 C-edited NOESY-HMQC spectra, 33 HNa coupling constants from HNHA, 
slowly exchanging NH protons and 13 Ca and l3 C(i secondary chemical shifts (for reviews see: 
(56) and (78). It was determined that the RGS4-core solution NMR was composed of seven 
helical regions corresponding to residues 7-12(al);17-36(a2); 40-53(a3); 61-71(a4); 86-95 
(a5); 105-125 (a6); and 128-132 (al). The RGS4-core overall fold is essentially comprised 
of two 4-helix bundles with the long helical region a6 part of both bundles.. A distinct 
difference in the RGS4-core secondary structure in solution from the x-ray structure of the 
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RGS4-Gial complex was unexpectedly observed at the C-terminus. The x-ray structure 
indicates that residues 104-1 16 and 119-129 are helical where only residues V5 to T132 are 
observed. The solution NMR structure indicates that residues 105-125 and 128-132 are 
helical and residues P134-H166 appear, in view of the sharp line-widths observed, to be 
extremely mobile. The differences in secondary structure between the x-ray crystal structure 
and that of free RGS4-core suggest a conformational change in RGS4 on binding to Get. 

Example 2: Three-Dimensional Structure Determination for RGS4-core 

RGS4-core was prepared, purified and uniformly labeled as in Example 1 . NMR 
samples were prepared and spectral data accumulated as indicated in Example 1. 

The RGS4 structure is based on the following series of spectra: HNHA (26), HNHB 
(27), 3D long-range ^C- 13 C correlation (28), coupled CT-HCACO (29, 30), HACAHB- 
COSY (31), 3D 15 N- (32, 33) and 13 C-edited NOESY (35, 37) experiments. The ^N-edited 
NOESY, and I3 C-edited NOESY experiments were collected with 100 msec and 120 msec 
and mixing times, respectively. 

Spectra were processed using the NMRPipe software package (36) and analyzed with 
PIPP (37) on a Sun UltralO Workstation. When appropriate, data processing included a 
solvent filter, zero-padding data to a power of two, linear predicting back one data point of 
indirectly acquired data to obtain zero phase corrections, linear prediction of additional points 
for the indirectly acquired dimensions to increase resolution. Linear prediction by the means 
of the mirror image technique was used only for constant-time experiments (38). In all cases, 
data were processed with a skewed sine-bell apodization function and one zero-filling was 
used in all dimensions. 
Interproton Distance Restraints. 

The NOEs assigned from 3D I3 C~edited NOESY and 3D E5 N-edited NOESY 
experiments were classified into strong, medium, weak and very weak corresponding to 
interproton distance restraints of 1.8-2,7 A (1.8-2.9 A for NOEs involving NH protons), 1.8- 
3.3 A (1.8-3.5 A for NOEs involving NH protons), 1.8-5.0 A, and 3.0-6.0 A, respectively (39, 
40). Upper distance limits for distances involving methyl protons and non-stereospecifically 
assigned methylene protons were corrected appropriately for center averaging (41). 
Torsion Angle Restraints and Stereospecific Assignments. 

The P-methylene stereospecific assignments and xi torsion angle restraints were 
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obtained primarily from a qualitative estimate of the magnitude of 3 J a p coupling 
constants from the HACAHB-COSY experiment (31) and 3 J^p coupling constants from the 
HNHB experiment (27). Further support for the assignments was obtained from approximate 
distance restraints for intraresidue NOEs involving NH, CaH, and CpH protons (42). 

The <f> and \[/ torsion angle restraints were obtained from ^Jnhcc coupling constants 
measured from the relative intensity of Ha crosspeaks to the NH diagonal in the HNHA 
experiment (26), from chemical shift analysis using the TALOS program (43) and from 
consistency with distance restraints for intraresidue and sequential NOEs involving NH, 
CaH, and CpH protons. 1 JcaHa coupling constants obtained from a coupled 3D CT- 
HCACO spectrum were used to ascertain the presence of non-glycine residues with positive 
backbone torsion angles (30). The presence of a *JcaHa coupling constant greater then 130 
Hz allowed for a minimum <f> restraint of -2° to -178°. 

The He and Leu X2 torsion angle restraints and the stereospecific assignments for 
leucine methyl groups were determined from 3 JcaC5 coupling constants obtained from the 
relative intensity of Ca and C5 cross peaks in a 3D long-range l3 C- 13 C NMR correlation 
spectrum (44), in conjunction with the relative intensities of intraresidue NOEs (45), 
Stereospecific assignments for valine methyl groups were determined based on the relative 
intensity of intraresidue NH-CyH and CaH-CyH NOEs as described by Zuiderweg et ah 
(1985) (46). The minimum ranges employed for the §, and % torsion angle restraints were 
db 30°, i 50°, and ± 20° respectively (47). 
Structure Calculations 

► 

The structures were calculated using the hybrid distance geometry-dynamical 
simulated annealing method of Nilges et aL (1988) (48) with minor modifications (49) using 
the program XPLOR (50), adapted to incorporate pseudopotentials for 3 J NHa coupling 
constants (51), secondary * 3 Ca/ 13 CP chemical shift restraints (52) and a conformational 
database potential (53, 54). The target function that is minimized during restrained 
minimization and simulated annealing comprises only quadratic harmonic terms for covalent 
geometry, ^J^^ coupling constants and secondary n Ca/ I3 CP chemical shift restraints, square- 
well quadratic potentials for the experimental distance and torsion angle restraints, and a 
quartic van der Waals term for non-bonded contacts. All peptide bonds were constrained to 
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be planar and trans. There were no hydrogen-bonding, electrostatic, or 6-12 Lennard-Jones 
empirical potential energy terms in the target function. 
Analysis of a T-J82 Binding Site on RGS4-core. 

- The overall appearance of the NMR structure in the area of the proposed Tl 82 (of 
Ga) binding site is one of great interest. To obtain a more quantitative measurement of the 
differences in accessibility between the free RGS4 NMR structure and the x-ray structure of 
the RGS4-G iat complex, MOLCAD (commercially available from TRIPOST) surfaces were 
calculated for both structures and the surface area of each was measured. 

The x-ray structure of the RGS4-G ictl complex (AGR1) was read into SYBYL 
(Tripos) and all substructures except chain E (RGS4) were deleted. Additionally, all waters 
were deleted. Polar hydrogens were added and optimized using the Kollman United Atom 
force field. This was followed by addition of all the remaining hydrogens. MOLCAD was 
then used to generate a surface for all residues thought to be involved in binding of T182 
(Ga-binding site). These RGS4-core residues include, 121, 127, F30, F33, L34, E37, S39, 
N42, 143, W46, II 10, LI 13, Ml 14, Dl 17, SI 18, R121. The surface area was calculated 
based on the MOLCAD surface. MOLCAD was also used to calculate the surface area for 
the identical residues of the free RGS4 NMR structure. The surface area for the free RGS4 
NMR structure was calculated to be 404.56 A 2 . The surface area for the crystal structure was 
calculated to be 321.88 A 2 . The difference in surface area of 82.67 A 2 , is an approximate 20% 
change in surface area between the two structures. A MOLCAD surface generated on the 
methyl and hydroxyl groups of T182 of Go; has a surface area of 57.72 A 2 . 

Example 3 : Identification of an Allosteric Binding Site in RGS4-Core 
Bead Precipitation Assay for Inhibition of RGS Binding to Go: 

Radiolabled [ 35 S]-Gailwas synthesized in a rabbit reticulocyte lysate in vitro 
translation reaaction (Promega, Madison, Wl Cat. NO. 14960) programmed with in vitro 
transcribed cRNA preparations (Promega, Cat. No. PI 290). Affinity-purified GST-RGS4 
core (lOOng, about 25 nM final concentration) is incubated with 17.5 ^L glutathione-^ 
Sepharose 4B bead (Amersham Pharmacia, Piscataway, NJ, Cat. NO. 17-0756-01) slurry in 
100^L binding buffer (IX PBS, ImM MgCl 2 , ImM DTT, 1% BSA) in a 96-weII microtube 
assay plate. Approximately 300^M of test compound (about O.lmg/mL final concentration, 
either as a mixture or individual compound) is added to each well and incubated at 4°C for 30 
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min. Approximately 50,000-100,000 cpm (?) 35 S]-Gail in lOO^iL assay buffer (IX PBS, 
ImMMgCVlOfiMGDP, ImMDTT, 30 fiM A1C1 3 , 1% BSA, 500 fiM NaF) is added to the 
reaction and incubated at 4°C for 30 min. The resulting assay sample has a final 
concentration of about 1-3 nM activated Gail. Reaction plates were centrifuged at lOOOxg 
for 3 min, and the supernatant aspirated. Beads were washed 2x by resuspension in 200 \ih 
binding buffer followed by centrifugation. Bound [ 35 S]-Gailis eluted from the bead pellets 
by resuspending them in lOO^iL 1% SDS. Elutates are either counted in 4 mL scintillation 
fluid or subjected to gel electrophoresis. Random small molecules can be evaluated in the 
assay described using a compressed library wherein a plurality of test compounds are 
combined in a single well (e.g., 10 compounds/well for 3000 primary assays tests 30,000 test 
compounds). Mixtures of test compounds that exhibited a greater than 50% decrease in 
precipitated radioactivity were confirmed by re-screening in an identical format. Combi- 
wells (here 10 test compounds/well) that tested positive in both assays were deconvoluted and 
the individual compounds were tested individually in an identical bead precipitation assay. 
Compounds that demonstrated the requisite decrease (about 50% or more) in precipitated 
radioactivity were further tested to confirm that the decrease in precipitated radioactivity was 
dependent on the RGS4-Ga interaction and not due to spurious activity of the test compound. 
In these cases, the assay precipitate was analyzed by gel electrophoresis to confirm the 
presence of RGS4 in the precipitate. 
l H' !5 NHSOC Chemical Shift Perturbation 

The RGS4 NMR samples contained 0.3mM of RGS4-core protein in a sample buffer 
(50mM KP0 4 , 2mM NaN 3 , and 50mM deuterated DTT in 90% H 2 O/10%D 2 O at pH 6.0). 
Test compounds were added to the sample in 10-fold molar excess. 2D 1H-15N HSQC 
spectra for free RGS4 and RGS4 in the presence of test compounds were collected over a pH 
titration range of 5.5-6.5. The spectral width in the indirectly detected 15 N dimension was 
30.00 ppm with the carrier position at 1 19.1 ppm. Spectral width in the acquisition 
dimension was 13.44 ppm with the carrier at the water frequency (4.73 ppm). The number of 
points acquired in the two dimensions was 256 complex in F1( 15 N) and 1024 real in F2( ] H)- 
All spectra were recorded at 35°C on a Brucker AMX-2 600 spectrometer using a gradient 
enhanced triple resonance 'H/^C/^N probe. Water suppression was achieved in the 
indirectly detected dimension with the WATERGATE sequence and water-flip back pulses 
(23, 24). Quadrature detection in the indirectly detected dimensions were recorded with 
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States-TPPI hypercomplex phase increment (25). Spectra were collected with appropriate 
refocusing delays to allow for 0,0 phase correction, processed using the NMRPipe software 
package (36)) and analyzed with PEPP (37) on a Sun Ultra 10 Workstation. Data processing 
included a solvent filter, a skewed sine-bell apodization function and one zero-filing in all 
dimensions. 

GTPase Functional Assay A single-turnover GTP-ase assay of G-protein a subunits was 
used. In this assay GTPase-induced hydrolysis of [y- 32 P]-GTP results in precipitation of 
radiolabel as 32 Pi. Unhydrolyzed [y- 32 P]-GTP is separated from precipitated label which is 
then counted. Precipitated label 32 Pi is directly proportional to the amount of [y- 32 P]-GTP 
hydrolyzed and to the activity of the G<x GTPase . 

Purified [y-32P]-GTP bound Get is prepared by incubating Ga (2jiM) with [y-32P]- 
GTP (2jiM) in a reaction buffer (total volume 30 jiL), lOmM Hepes (pH 8.0), 5 mM EDTA, 
2mM DTT, 0.05% C12E10 (Lubrol, ICN Biomedicals, Inc., Aurora, OH), 10 iig/mL BSA) 
for 30 min at 30° C. Unbound [y- 32 P]GTP is removed using a gel filtration column (Centri- 
Sep, Princeton Separations, Princeton, NJ) according to the manufacturers directions. The 
eluate containing [y- 32 P]GTP bound Ga is collected and the protein is recovered (typically up 
to about 80-90%) after centrifugation at 2000ipm for 2 min at 4° C. 

All steps of the assay are performed at 4° C. The purified [y- 32 P]GTP bound Ga 
obtained above is added to 500 fiL of reaction buffer (as above) and separated in to eight 
50jiL samples (a zero time control (no initiation) and seven assay time points). The reaction 
is initiated by adding 10jiL of 1M MgCl 2 and IOjiL of lOmM GTP to the seven assay 
samples. After 10, 20, 30, 40, 60, 90, and 120 seconds, respectively, 750_ of stop buffer 
(50 mM NaP0 4 (pH 3.0), 5% activated charcoal) is added to one of the assay samples. The 
control and samples are then centrifiiged at 100,000 rpm for 10 min to precipitate the charcoal 
and 500 of supernatant is remove to assay radiolabel present GTPase activity is 
expressed as the amount of free [ 32 P]-phosphate released from [y- 32 P]GTP. Phosphate release 
(finol) - radioactivity (zero time control- time assay) (counts)/specific activity of [y- 
32 P]GTR 

GTPase activity of Gai in the presence of RGS4-GST fusion protein was determined 
as described above where GTP hydrolysis by 100 nM Gai was initiated by the addition of 
MgCI 2 in the presence and absence of lOOnM RGS4-GST protein. GTP hydrolysis at the 
indicated time points was calculated as the amount of 32 Pi released (in finol). The dose- 



dependent effect of RGS4-GST protein on the hydrolysis of GTP-Gai was measured as 
described above in the presence or absence of 10 nM or lOOnM RGS4-GST protein, * 

The effects of test compounds are evaluated for modification of the activity of the 
RGS4 core domain. The RGS4 core protein was generated as a GST-RGS4core fusion using 
standard molecular techniques. Briefly, the core region of RGS4 was obtained using PCR to 
generate a cDNA fragment encoding amino acid 5 i (val) to the C-terminal end of the 
protein, amino acid 206 (ala). The 5' forward amplification primer contained an embedded 
BarnHI restriction site, followed by nucleotides encoding a flexible linker, Gly-Ser-GIy-Ser, 
prior to the Val residue of rat RGS4. The 3 T reverse amplification primer contained a stop 
codon, followed by an embedded BamHI site. - The amplimers were used with pWE2RGS4 
(Shuey et aL, 1998 (84) ) as template to generate a PCR product of approximately 625 base 
pairs. This PCR product was BamHI digested, purified and ligated in the BamHI site of 
pGEX-2T (Arnmersham Pharmacia, Piscataway NJ) to generate pGST-RGS4c recombinant 
plasmid. Plasmid was tranfected into bacterial cells, and DNA prepared by standard methods, 
and confirmed by sequence analysis. GST-RGS4c fusion protein was generated and 
purified according to manufacturer suggestions for expression using the pGEX-2T vector. 

To measure the effect of test compounds on the activity of RGS4 core domain, RGS4- 
GST fusion protein (1.6 jiM) is incubated with test compound (or mixtures of test 
compounds) (30|iM-40^iM each) or DMSO for 1 hr at 30°C Thereafter, GTPase activity of 
Gcri (lOOnM) is measured in the presence or absence of the RGS4-GST treated with the test 
compound (lOOnM). Each assay is replicated at least three-times. RGS4-GST was treated 
with Compound I at 30 jiM and inhibited the GTPase activity of IIGS by 30%; while RGS4- 
GST treated with Compound 1 at 300 \xM inhibited GTPase activity in comparison to a 
DMSO control 

Those of ordinary skill in the art will appreciate that reagents, methods, procedures 
and techniques other than those specifically disclosed herein are known in the art and can be 
readily employed or adapted to the practice of this invention to achieve the results of this 
invention. All such art-known functionally equivalent reagents, methods, procedures and 
techniques are intended to be encompassed by this invention. AH references cited herein are 
incorporated by reference herein in their entirety to the extent that they are not inconsistent 
with the disclosure herein. 



Table I 15 N, 13 Q l3 CO and *H resonance assignments for RGS4 at pH 6.0 and 30°C a 



Residue 


N 


CO 


Ca 


cp 


Others 


MI 


-(-) 


176.1 


55.9 (4.30) 


29.4(2.10,2.00) 


Cy33.9 (2.38);Ce21.7 (0.36) 


R2 


123 0 (8 34} 


177.2 


56.7 (4.29) 


33.0(1.84,1.77) 


Cy24.8 (I.45);C5 29.2 (1.71);C£ 42 2 (3 00) 


G3 


1 10.4 (8.40) 


173.7 


45.3 (4.00) 






S4 


115.7 (8.23) 


174.0 


58.1 (4.59) 


64.2 (3.90) 




V5 


121.6 (8.20) 


174.9 


61.2(4.38) 


33.6 (1.99) 


Cy21.7 (0.98)21.9 (0.89) 


S6 


122.3 (8.56) 


175.1 


57.5 (4.51) 


65.3 (4.33,4.02) 




07 


121 5 f8 95} 


178 2 


58 5 f3 80} 


28 3 (2 02) 


Cy 34 3 (2 39) 


F& 


1 1 8 9 (8 48} 


178 8 


59 7 f3 85} 


29 H2 01 1 94} 


Cy36,6 f2 29) 




1 90 ^ f7 4^ 


177 O 


Q n 77> 


?9 4 f2 04 1 72} 


Cv 36 4 f 7 23} 




llO.O ^/.lO) 


ioU.O 


oj.u ^j.oyj 


^ I £ ( 1 




^ JVI I 


1ZZ.U ^ I ,OOj 


17Q £ 








J-Jf"; 










Co ±0 1 H 01 9 7<^ 




IzU.vJ {t.jy} 


loU.4 




ii 7 n n i oq^ 














r^t* /to n /o qq\ 


£ : 

-T-T M 1 

:if:W13 


120.7 (7.84) 


176.7 


57.2 (4.63) 


29.0 (i.4o) 


^*St n< /4 f£ Cn\. xTo1 no 'i/Q £GV " 
Col Izo.^ (o.O/), iN£i Izy.ov."- 0 "^ 


y ; 










CC,Z lli.y(o.20); CT|z iz4.U(o.5iy\ 












121.3(7.18) 


SA14 


117.5 (7.48) 


176.9 


52.3 (4.36) 


18.6 (L52) 




*E15 


117.5 (7.80). 


177.1 


57.6 (4.29) 


30.9 (2.16) 


Cy^6.6 (2.48,2.25) 


HIS 16 


1 11.6 (7.02) 


173.6 


57.5 (4.50) 


63.0 (4.22,3.93) 




C;L17 


125.4 (8.36) 


177.5 


57.1 (3.17) 


39.0 (0.97) 


Cy26.2 (0.65);C5 25.9 (-0.48);23.9 (-U.Osj 


QE18 


117.2(8.68) 


177.7 


60.1 (3.63) 


29.3 (1.98,1.91) 


Cy 36.5 (2.19) 


M.N19 


116.0 (7.53) 


176.9 


55.3 (4.37) 


38.4 (3.20,2.78) 




L20 


119.5(7.01) 


176.4 


58.2 (3.50) 


41.8(1.79,1.63) 


Cy24.7 (0.52);C5 26.9 (-0.03);24.9 (0.33) 


121 


105.6 (7.37) 


175.1 


63.3 (3.65) 


37.1 (1.61) 


Cym 1 9.2(0.33);Cy (0.70,0.8 1 ); 












C8 123(-0.62) 


N22 


115.7(7.10) 


173.8 


53.9 (4.76) 


39.5 (2.88) 




H23 


121.6 (7.32) 


176.0 


56.4 (4.71) 


33.9 (3.31,2.98) 


C52 138.6 (8.15) 


E24 


128.2 (9.04) 


179.5 


60.6 (4.07) 


29.8 (2.09) 


Cy36.1 (2.44^.34) 


C25 


121.4(10.57) 


176.9 


61.8 (4.46) 


27.3 (337) 




G26 


114.1 (7.36) 




46.6 (2.86,2 


.45) 


/ 


L27 


122.8 (8.66) 


178.3 


58.2 (3.76) 


41.4(1.72,1.65) 


Cy 27.1 (1.60);C5 25.6 (1.05); 23.5 (1.04) 


A28 


118.3(7.29) 


180.8 


55 J (4.06) 


18.4(1.60) 




A29 


122.8 (7.85) 


178.9 


55.3 (4.51) 


18.2(1.84) 
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F30 


120.0 (8.51) 


177.8 


59.1 (3.94) 


K31 


118.9 (9.02) 


177.5 


60.6 (3.59) 


A32 


' 121.6(7.82) 


180.3 


55.3 (3.98) 


F33 


121 5 (7.82) 


177.8 


59.9 (4.39) 


L34 


122.0 (8.57) 


180.0 


57.6 (3.34) 


K35 


120.9 (8.60) 


180.1 


59.6 (3.90) 


S36 


116.7 (7.43) 


174.1 


61.1 (4.09) 




118 0(6.86) 


175.3 


55.3 (4.28) 


Y3& 


118 8 (7 60) 


175.8 


58.7 (4.37) 


S39 


113.0 (8.05) 


175.5 


57.0 (5.04) 


F40 


127 2 (9 07) 


175.9 


58.2 (3.86) 


"PA1 


122 3 flO 221 


177 2 


60.9 (4.25) 


N47 


116 4 (7 67) 


176.9 


56.7 (4.52) 


143 


117.6 (6.99) 


176.2 


58.9 (4.52) 


D44 


126.2 (8.38) 


179.6 


57.6 (4.59) 


F45 


123.8 (8.40) 


176.5 


60.2 (4.37) 


W46 


122.3 (7.98) 


177.7 


64.9 (4.39) 


14/ 


1 1 5? 9 (Q 041 


178 4 


65 1 (3 37) 


. P; 


117 4(7 911 


176 6 


62*8 (4.22) 


as: V^-^ V 


120 4 (7.53) 


179.6 


64.0 (3.90) 




120 1 f7 39) 


179.1 


58.8 (3.58) 


:i "Is' 


1 18 3 (1 931 


180.0 


59.0 (3.84) 


bss; 


123 0 (8 1 1) 


175.6 


6L2 (3.95) 


s _s i iVJJ 


112 6 (7 43) 


177.1 


58.2 (3.89) 


a 1SJ4 


117 *\ f7 1 Rl 


176 9 


56 1 (4 17) 


w 155 




176 ft 


64 0 T3 581 


Ms 1 Koo 


tn< < (9. 
1/O.0 \Q.jy) 


17A 7 
1 /O. / 


3 (4 341 


; ^ O J / 


1 16 2 f7 351 




54.6 (4.98) 


^ iOo 


( \ 


17R S 
X / 0.0 


65 0 (4 191 




in 1 /o 1 <\ 

I IZ.J {Q.iJ) 


17^ ^ 


60 5 T4 321 


r™ K60 


119.7 (/•//) 


ITT < 

1 / / .o 


*A 1 f± 3^ 
JU.J ^.^O^ 


L61 


120.4 (7..49) 


1 TO < 

178.5 


JO. / 


S62 


112./ (0.0/) 




n ^4 381 




( \ 


ISO 0 


66 0 (4 391 




1 1 K 0 f7 081 


178 0 


57 9 (4.36) 


A65 


122.5 (9.23) 


179.9 


55.5 (4.19) 


K66 


116.9(8.40) 


178.4 


60.2 (3.99) 


K67 


119 5 (7.52) 


179.6 


59.9(4.14) 


168 


120 9 (8.24) 


178.4 


65.9 (3.87) 


Y69 


120.8 (9.15) 


178.6 


62.4 (3.99) 


N70 


117.5(8.77) 


176.0 


55.7 (4.42) 


E71 


118.4(7.88) 


176.4 


58.8 (4.00) 



39.5(2.62,2.38) C5-(6.34) 

32.4 (1.94,1.74) Cy 26.0 (1.70,1.44);C5 - (1.63);Ce - (2.96) 
17.8(1.61) 

39.2 (3.18,2.00) C5 - (6.78);C£ - (7.18) 

40.2 (0.88) Cy26.2 (1.0I);C8 21.8 (0.57); 26.3 (0.19) 
32. 1 (1.79) Cy24.9 ( 1 .54, 1 .39);Cs - (2.93) 

63.1 (4.04,2.38) 

29.9 (1.98,1.45) Cy35.9 (1.66) 

35.5 (3.27,3.09) C5 133.4 (7.04);Cs 1 18.2 (6.84) 

64.5 (3.833-71) 

30.2(1.96,1.75) Cy36.9(2.20) 

27.7 (2.13,1.98) Cy35.6 (2.37) 

38.8 (2.68) 

38.3 (1.58) Cym 20.7 (1.05); C5 14.6 (0.92) 

39.1 (2.85) 

39.0 (2.98,2.50) C5 - (6.67) 

29.9 (3.78,3.21) Nsl 130.7 (10.68);C?2 1 15.1 (7.25); 

Ct]2 125.8 (7.01);C£3 122.9(7.01); 
Ce3 119.9(7.35) 

38.6 (2.02) Cym 17.0 (0.95) ;Cy 29.4 (2.24,1.46); 

C5 14.3 (1.07) 

- (3.89,3.67) 

- (3.61) 

29.0(1.65) Cy35.2(-) 

29.7(2.21,1.95) Cy 36.3 (2.15) 

38.7 (3.302-95) CS - (7.03);Cs 1 18.5 (7.15) 

32.2 (1.86,1.76) Cy - (1.54,1.39) 

33.1 (1.98,1.71) Cy 25.4 (-);C5 29.4 (1.60);Ce 42.2 (-) 

37.2 (1.74) Cym 19.4 (0.75);Cy 23.3 (0.35) 

33.6 (1.95,1.72) Cy 24.6 (1.40);C5 28.7 (1.64);Cs 42.1 (3.00) 

64.0 (3.79) 

32.1 (2.45,2.06) Cy 27.5 (2.17,2.06);C5 51.4 (4.1 1,3.98) 

62.7 (3.90) 

33.2 (2.03,1 .89) Cy 25.7 (-);C5 29.0 (-);Cs 42.4 (-) 

41.3 (1.82,1.71) Cy 26.8 (1.67);C5 25.9 (0.74);23.2 (0.35) 

60.8 (4.08) 
30.8 (2.42,1.95) 
31.9(2.26) 
17.7 (1.76) 

32.4 (1.93) Cy 25.8 (1.43);C5 29.2 (1.68);Cs 42.1 (3.00) 
32.7 (2.09) Cy 25.2 (1 .66,1.52) 

38.3 (2.08) Cym 18.3 (0.93); CS - 0.30 

39.5 (3.52,3.26) C5 133.4 (7.07);Cs 1 17.9 (6r78) 

38.4 (2.99,2.78) Ny 1 12.0 (7.59,6.97) 

30.6 (1.76,1.37) Cy35.9 (1.80,1.09) 
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F72 
i / x» 


1 10 4 (7.76) 


175.1 


58.5 (4.95) 


43.4 (2.77) 


C5 133.6 (7.75):C£ 133.1 (7.19);CC - (6 891 


173 


116 5 (7 59) 


175.3 


61.3 (4.05) 


38.0(1.59) 


Cy27.2 n.43,1.03);CYm 18.5 (0.79): 












C8 12 0 (0 76) 


S74 


113 4(7 43) 


174.6 


57.7 (3.89) 


64.1 (3.44) 




V75 


120 6 (8 20) 


177 1 


64 4 (3 99) 


31 3 (2 18) 


Cy20.3 (I 02)^1 1 fl 02) 


076 


1 18 6 (8 10) 


175 4 


55 7 (4 29) 


78 ? (2 23 1 87) 


Cy34 6f2 30) 


All 


124 in 54) 


178.7 


52.8 (4.11) 


18.8 (1.10) 




T78 


114 8 {1 99) 


175.6 


64.0 (4.05) 


68.8 (4.11) 


Cy 22.6 (1.27) 


K79 


173 8 (1 89^ 


173 4 


54 6 (4 47) 


33 HI 76 1 39) 


Cy25 2(1 27)*C8 29 7(1 64 1 51)* 












Cp 42 0 (2 84^ 




1 1*7 9 f"7 

11/ -<5 ^ /.iu; 


T7^ 7 




-in n n 5?7^ 


fv 14 />Q 7 1 0"i 


V81 


» irt /O 01\ 

1 19.3 (8.22) 


1 /O.J 


jy.D (400/ 


oo /i /o (\\\ 




N82 


t oo < /a ao\ 
122.5 (y.uy) 


I /i. / 




^7 ^ CI 5?0 7 A'^^ 

j/.j ^z.t>y,z.oj) 




L83 


122.7 (7.43) 


176.1 


53.1 (4.52) 


4o.J (I.4y,I.J4) 


cyzo.4 (-); to Z4.5 ^u.y4) - o) 


D84 


1 OA 1 00\ 

1ZU.5 (o.Z/) 


1/0.3 




a t 7 f o 7 70^ 




<^o< 


I 10. 0 ^o.y 3} 


T7(^ 7 
1 /O. / 




- (AO)} 




LOU 


171 4 f8 36) 


177 9 


62 4 (4 30) 


76 1 (3 07) 




AO/ 


120 2 (8 35"! 

X X, V . X. ^U.»f 


178 5 


67 0 f 4 06) 


67 0 (4 06) 


Cy^2 7 (1 22) 




17^ S ^Vt 


17R 7 


61 ^ n 73^ 


29 9 n 98 1 9H 

£~y~y .7 1 y 


Cy- H 51 1 67V C543 3 (3 10) 


to? 


ni i/o /iQ\ 
1ZI.1 y^o/ty) 


17Q ^ 




79 7 T7 17 7 ftf^ 


Pv36 7 T7 39> 


^ £90 


121.2 (o.44) 


1 o© < 
1 fO.D 


jy.3 (4.U1 / 


on /t n 17^ 


PvlC A fO /ffj\ 

j.o ^z.'+u; 




115.1 (8.10) 


175.8 


67.8 (3.91) 


AO A tA A^\ 

68.4 (4.42) 


Cy22.3 (1.45) 




i i/; if nON 

1 16.6 (o.iZ) 


1 /O.O 


oz.4 ^.yu; 


fn i (a no ^ gqa 
OA. i x^Ajyjjjyy) 




.JL , Ky5 


1 OO 1 /O 
1ZZ.5 (/.o4) 


i /y.o 


jy.j ^.uj } 




r*v77 q n f?9 t ^nvr , 8 43 5 n 73^ 


^ in y 4 




1 7(^ 7 
1 / u. / 


J'-r.y \y-.JJ J 






: iViy J 


I 1 D* i \ /JHJ) 


17^ 0 


< < 1 (A 07^ 


31 3 H 83 1 77^ 


Cv31 7 T7 20 1 04V C£ 15 3 CI 67) 


HJ t qa 
£ f i -L90 


iiQon no\ 


175? A 
1 /O.H- 


S7 7 (A 07^ 


41 5 fl R0 1 "SV> 


r-v77 n n ^n^-r^ 23 0 ro 85V 24 8 (0 95) 


rryr 
S ^9 / 


1 10. 0 ^ / ,ZU) 




<o o //t ^0\ 




pv'lA 1 T9 im 


•ass. DOS? 


~ {') 


i / o.y 




37 T (1 76 1 


r*v77 0 f7 OSVr*^ 49 9 G 59) 


g.i iyy 


1 1^. / ^.^J; 


1 fy.L 


A (A 1Q\ 


77 > T4 


Pv71 1 H 301 






1 T< 1 

1 / O.I 


£s\ o //t oo^ 


-3Q A T1 07^ 


fill iy.l \L.W>),\~j £q.j \,i.*r/ t i.jyj, 












v^o i4.o ^u.yuj 


w TlOl 


109.8 (7.61) 


176.2 


61.5 (4.57) 


AO 1 A"i\ 


O 1 O* /" t' OHA 




12.2.. / (/.4o; 


TO*/f < 


AO A /"2 S'T* 


Zo.y V-5-I-JtZ.JIJ 




F103 


1 1 O O / O TON 

1 12.8 C8.78) 


1 *7 C O 

175.2 


5 /.o (4.95) 


Af\1 (1 Ol O <*7% 
4U. / (3. / i,Z.D / J 


/A TAA 
CO - vo. /O) 


D104 


-tOO /O TON 

122.0 (7.72) 


179.1 


CO A //4 00\ 

58.9 (4.28) 


iCi Q fl CO O 

iy.o (z.oz,2.5y) 




TT 1 A< 


i 19.9 (o./j) 


i / J.U 


en /+ m i A^ 
Dy.4 1^4.10) 


OO A /O /M O 1 OA 

Zo.O ^Z.f4-,Z.1Z) 




a i oa 


no Q ro 


I / O.O 


JJ.^T ^.Wj J 


TS? A /I A^ 




/~\ i AO 


i i 9 7 /O 

i xo. / ^y.^oj 




AA ^ QCY\ 


77 0 M 


rV3^ 7 M 


JS.IU5 


1ZU. i ^o,l I ^ 


170 ^ 


<o/;M7n 


37 4 (1 18 2 07^ 


Cv25 2 fl 46VC5 29 8 (L69):C£ 42.2 (3.00) 


"K*i no 
iviuy 


I lo.O \_O.u0 / 




SQ 0 77^ 
-jy+\j v^*^ ' y 


31 8 f 2 27 7 04^ 


Cy24 9 Q 44VC5 29.1 (L7L 1.81); 












Ce 42 0 (3 00) 


1110 


121.3 (8.30) 


177.9 


61.5(4.22) 


36.0 (2.44) 


Cr 29.0 (2.06,1.43);Cym 19.6 (1.38); 












C5 10.0 (0.76) 


Fill 


124.5 (9.22) 


176.8 


63.2 (3.80) 


39.2 (3.5U-21) 


C5 13L9(6.87);Ce-(7.04) 
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N112 


117.2 (8.51) 


177.5 


56.3 (4.50) 


38.6 (3.08,2.90) 




LI 13 


121.5 (7.95) 


180.3 


58.3 (4.21) 


42.3 (1.99) 




Ml 14 


118.6 (8.22) 


177.0 


60.0 (4.28) 


35.0 (1.82,1.68) 




El 15 


120.2 (8.46) 


177.3 


60.4 (3.49) 


29.6(1.58) 




K116 


H3.5 (7.57) 


176.9 


57.5 (4.26) 


33.1 (1.98) 




D117 


116.6(7.62) 


176.6 


55.8 (5.08) 


42.8 (2.92,2.84) 




S118 


117.2 (8.45) 


176.2 


63.0 (4.52) 


63.6 (4.33,4.02) 




Y119 


121.8 (8.95) 


175.5 


60.4 (4.27) 


39.2 (2.81,2.00) 




R120 


114.4(7.11) 


179.8 


58.3 (3.52) 


28.7 (1.96,1.80) 




R121 


117.5(7.63) 


179.4 


59.6 (3.97) 


30.3 (2.07) 




F122 


124.0 (8.57) 


178.4 


60.0 (332) 


38.3 (2.89,2.76) 




L123 


1 17.7 (7.05) 


176.3 


56.3 (3.27) 


41.4(1.11,1.06) 




K124 


114.3 (6.72) 


175.5 


54.8 (4.42) 


33.0 (2.09,1.63) 




S125 


118.2 (7.83) 


175.6 


57.9 (4.79) 


67.1 (4.52) 




R126 


12 LI (9.39) 


176.5 


58.2 (3.92) 


28.4 (1.81) 




F127 


115.5 (7.47) 


176.0 


60.7 (4.21) 


39.5 (3.42,3.27) 




Y128 


1 14.2 (6.68) 


177.2 


59.3 (4.40) 


39.8 (2.39) 




T 179 


116 8 (8.34) 


180.4 


57.6 (3.71) 


41.7(1.41,1.37) 




D13Q 


1190(8.61) 


177.8 


56.8 (4.38) 


39.8 (2.63) 






)M 5 (1 46) 


177.8 


55.7 (4.18) 


42.1 (1.83,1.43) 








173 9 

J. / — t - J 


616 (4.26) 


70.1 (4.09) 




IN i 33 


i to 9 r7 Am 


173 0 


51.3 (4.95) 


38.8 (2.78,2.62) 




Fli4 




177 4 

lit 


63 7 (4 43) 


32.1 (2.27,1.96) 




3135 


i 13.3 {O.JA} 


174 Q 




63.7 (4.42,3.88) 




o 1 Jo 




174 7 


58 5 (4 47) 


63.8 (3.90,3.13) 


1 s"; 




170 7 f& 27") 


175 1 


58.7 (4.55) 


28.1 (2.95) 






1 1 1 5 (8 43) 


173.8 


45.4 (3.96) 


-(-) 


yi 


A139 


124.0 (8.10) 


178.0 


52.6 (4.30) 


19.4 (.1.43) 


E 1 


E140 


120.0 (8.48) 


176.8 


57.0 (4.22) 


30.0 (2.01) 


IB 


1 41 




176 7 


56.5 (4.27) 


32.9 (1.80) 




OT47 


171 I f 5? 77^ 


173 1 

A t ~fJ •> a> 


55.0 (4.16) 


33.1 (2.21) 




K143 


120.7 (8.27) 


176.3 


56.7 (4.42) 


30.8 (1.92,1.86) 




G144 


111.1 (8.57) 


176.1 


45.4 (3.97) 


-(-) 




A 145 


124 0 (8 15) 


177.9 • 


52.7 (4.30) 


19.5(1.43) 




JvI40 


170 Q 15^ 

1ZU.7 ^O.-J-Jy 




56 3 (4 35) 


33.1 (1.81,1.45) 




SI4/ 


1 1 /.J ^ojy; 


T74 5? 


5R 2 f3 94) 


64 1 (4.50,3.90) 






I IO. 3 |^O.J7; 


174 5 


58.5 (4.47) 


63.9 (3.90) 






175 7 f8 30) 


177 5 


52.8 (4.29) 


19.3 (1.38) 






119 3 18) 

117.-' ^o.Atjy 


176.5 


54,4 (4.61) 


41.2 (2.70) 




C151 


119 8 (8.28) 


175.1 


58.8 (4.58) 


27.9 (2.98) 




T152 


116.4 (8.28) 


174.7 


62.5 (4.33) 


69.7 (4.27) 




S153 


118.0 (8.20) 


174.2 


58.4 (4.47) 


63.8 (3.93,3.87) 




L154 


124.3 (8.18) 


177.0 


55.2 (4.37) 


42.4 (1.61) 




V155 


122.4 (8.01) 


174.4 


59.9 (439) 


32.6 (2.06) 



Cy27.I (i.83);C5 24.9 (1.06) 
Cy 32.0 (2.38^.26);Ce 15.9 (1.73) 
Cy 37.3 (2.07) 

Cy 24.7 (1.59);C5 29.5 (1.72);Cs 42.3 (3.04) 



CS-(6.72);C£ 117^ (6.58) 

Cy 27.0 (1.87,1.64);C8 42.9 (3.16,3.10) 

C5 43.1(-) 

CS - (6.77) 

Cy25.6 (1.3 1);C5 25.2 (0.32); 19.6 (-0.09) 
Cy 24.8 (1.46);C5 28.9 (1.70);Cs 42.2 (2.97) 

Cy25.5 (1.60,1.43); C542.5 (3.05^.97) 
C5-(7.18) 

C8 133.0 (7.17);Cs 1 17.6 (6.91) 
Cy26.8 (1.76);C825.6 (0.74);22.4 (0.89) 

Cy22.9 (1.27);C522.4 (0.94);22.6 (0.33) 
Cy 2 1.6 (0.73) ' • 

Cy27.2 (1.97);C5 50.6 (3.67,3.50) - 



Cy 36.2 (2.28) 

Cy24.9 (1.44);C529.0 (1.71);Cs42.2 (3.00) 
Cy 30.9 (2.64) 

Cy24.7 (-);C527.1 (1.71);Cs 43.5 (3.25) 



Cy24.8 (1.46);C529.1 (1.71,1.84); 
Cs 42.2 (3.01) 



Cy21.7(1.24>- 

Cy 27.0 (1.61);C5 23.5 (0.86);24.9 (0.90) 
Cy21.0(0.94);20.4(0.94) 
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Table 1 continued 



PI $A 

r I JO 


ri 


177 0 

Iff ,KJ 




/ 




t7A T 


I f A 7CT\ 
JO. I ^H-.Z.^ 


C158 


* 120.2(8.31) 


174.1 


58.2 (4.45) 


A159 


1263 (8.32) 


177.3 


52.6 (4.25) 


H160 


117.9 (8.24) 




55.5 (4.62) 



32.2 (2.30,1.90) Cy27J (2.03,1.94);C8 51.1 (3.863.69) 

29.5 (2.06,2.00) Cy 33.9 (2.4 1) 

28.2 (2.88) 

19.3 (1.30) 

29.8 (3. 15,3.06) C52 1 1 9.5 (7. 13) 



Footnotes to Table S 1 

a In each column, !5 N and l3 C shifts are listed first, and the corresponding l H shirts are given in 
parentheses. *H and I3 C chemical shirts are reported relative to 3^trimethyIsily)propionic-d4 acid and 
15 N shifts relative to external liguid NH3. 




/ 
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Table 2 

Restrained Minimized NMR Coordinates for Free RGS4 



Pages 42-70 



The structural coordinates herein are deposited with Brookhaven Protein Database. 

(Brookhaven National Laboratory) 

Deposit No. 
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2.413 -2.623 1.00 0.59 

1.618 -4.135 1-00 0.56 

3.690 -4.348 1.00 0.63 
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4.006 -3.214 1.00 1.77 

2.628 -4.311 1.00 1.80 

4.531 -5.257 1.00 1.66 
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4.073 -6.228 1.00 2.08 

5.874 -5.261 1.00 2.39 

5.805 -5.699 1.00 2.84 

6.215 -4.282 1.00 2.78 
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